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INTRODUCTION 


Despite  improvements  in  medical  treatments  over  the  past  three  decades,  prostate  cancer  remains 
the  second  most  common  cause  of  cancer  death  among  U.S.  men.  According  to  National  Cancer 
Institute,  in  2012  ~24 1,000  prostate  cancer  diagnosis  and  ~28,000  related  deaths  of  American 
men  are  estimated.  Androgen  deprivation,  surgery,  and/or  radiotherapy  in  combination  with 
chemotherapy  has  proven  to  be  effective  in  treating  patients  that  display  localized  disease; 
however,  progression  to  hormone  refractory  aggressive  disease  in  a  subset  of  prostate  cancer 
patients  remains  the  primary  cause  of  mortality.  The  molecular  mechanisms  that  contribute  to  the 
progression  of  localized  disease  into  an  aggressive  disease  remain  largely  unknown.  Long  non¬ 
coding  RNAs  (IncRNAs),  have  recently  emerged  as  key  players  in  tumor  biology  and  can 
potentially  serve  as  promising  biomarkers.  However,  the  vast  majority  of  IncRNAs  remain 
undiscovered  or  uncharacterized  entities  and  their  roles  in  prostate  cancer  are  unclear.  Recent 
advances  by  our  lab  using  high-throughput  sequencing  of  prostate  cancer  tissues  have  led  to  the 
systematic  identification  of  IncRNAs  aberrantly  expressed  in  prostate  cancer  that  may  play  a  role 
in  prostate  cancer  progression.  The  two  main  objectives  of  this  proposal  were: 

1)  To  evaluate  the  role  of  prostate  IncRNAs  in  the  progression  of  prostate  cancer  and, 

2)  To  explore  the  potential  of  prostate-specific  IncRNAs  to  reliably  predict  disease  progression. 


1.  KEYWORDS 

IncRNA,  AR,  PC  AT,  SCHLAPl,  DHT,  RNA 

2.  ACCOMPLISHMENTS 

What  were  the  major  goals  of  the  project? 

Following  three  aims  were  proposed. 

Specific  Aim-1:  To  characterize  nominated  IncRNAs  associated  with  metastatic  castration- 
resistant  prostate  cancer.  (Month  1-18) 

This  specific  aim  is  100%  completed 

Specific  Aim  2:  To  elucidate  the  function  of  IncRNA  in  metastatic  castrate  resistant  prostate 
cancer:  (Month  6-24) 

This  aim  is  90%  completed 

Specific  Aim  3:  To  identify  IncRNAs  that  serve  as  potential  biomarkers  of  disease 
progression.  (Month  12-24) 

This  aim  is  50%  completed 

What  was  accomplished  under  these  goals? 
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Various  tasks  proposed  in  individual  specific  aims  are  written  below.  Results  obtained  under 
each  task  are  shown  in  form  of  figures. 

Specific  Aim-1:  To  characterize  nominated  IncRNAs  associated  with  metastatic  castration- 
resistant  prostate  cancer.  (Month  1-18): 

Task  1:  To  validate  predicted  transcripts  in  prostate  cancer  cell  lines  by  qPCR  (Month  1-3) 

Progress  (100%  completed) 

We  validated  several  IncRNAs  including:  PCAT29,  SCHLAP,  G5303  and  PCAT51.  The 
expression  of  each  IncRNA  in  prostate  cell  lines  was  determined  by  qPCR.  Expression  of 
PCAT29  and  SCHLAP- 1  has  been  published  (see  appendix) 
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Figure  1:  Expression  of  each  IncRNA  in  prostate  cancer  cell  lines.  Inset  shows  the  expression  of  PCAT51 

determined  by  RNA-seq  in  prostate  cancer  cell  lines. 


Task  2:  To  determine  exon  structure  by  rapid  amplification  of  cDNA  ends  (RACE)  (Month  1-6): 

Progress  (100%  completed) 

We  choose  PCAT29,  SCHLAP  and  PCAT51  for  further  study.  We  performed  RACE  on  various 
cell  lines  to  determine  the  exon  structure  of  each  transcript.  Show  below  is  exon  structure  of 
PCAT51  as  determined  by  RACE.  Further,  Northern  blotting  was  performed  to  confirm  the  size 
of  PCAT51.  Exon  structure  of  PCAT29  and  SCHLAP  are  published  (see  appendix) 
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Figure  2:  Genome  browser  view  of  PC  ATS  1.  Exon  strueture  determined  by  RACE  as  well  as  other  gene 
annotation  is  shown.  Inset  shows  PC  ATS  1  northern  blot. 


Task  3:  To  construct  siRNA  for  the  most  conserved  isoform  and  screen  for  IncRNA  knockdown 
efficiency  of  by  qPCR  (Month  6-9) 

Progress  (100%  completed) 

We  made  4  siRNA  targeting  PCATSl.  These  siRNA  were  tested  in  LNCaP  and  MDA_PCa_2b 
cells.  In  both  the  cell  lines  we  got  80%  knockdown  of  the  gene.  Gene  expression  was  quantified 
by  qRT-PCR.  siRNA  for  PCAT29  and  SCHLAP  are  now  published  (see  appendix) 


MDA_PCa_2B  LNCaP 

Figure  3:  MDA_PCa_2B  and  LNCaP  cells  were  transfected  with  4  independent  siRNAs  targeting 
PCATSl.  Expression  of  PCATSl  was  determined  by  q-RT-PCR 

Task  4:  To  generate  stable  knockdown  cell  lines  using  shRNA  constructs  (Month  6-9): 

Progress  (100%  completed) 

We  made  shRNA  constructs  targeting  PCATSl,  PCAT29  and  SCHLAP  1.  Two  best  siRNA 
sequences  targeting  the  genes  were  used.  shRNA  knockdown  data  for  PCAT29  and  SCHLAP  1  is 
published  (See  appendix). 
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Task  5:  To  clone  two  most  abundant  IncRNA  isoforms  into  pcDNA  vector  (month  6-9) 

Progress  (100%  completed) 

Two  most  abundant  isoforms  of  PCAT29,  SCHLAPl  and  PC  ATS  1  were  amplified  by  PCR. 
These  isoforms  were  then  PCR  amplified  and  cloned  into  lentiviral  vector  (pcDH). 

Task  6:  To  generate  stable  cell  lines  (RWEP)  over-expressing  IncRNAs  by  lentiviral 
transduction  (Month  6-12) 

Progress  (100%  completed) 

Viral  particles  were  made  following  generation  of  stable  cell  lines  expression  individual 
IncRNAs.  These  cells  lines  were  tested  for  gene  overexpression. 

Task  7:  To  perform  in-vitro  proliferation,  migration  and  invasion  assays  using  stable  cell  lines 
(Month  9-12) 

Progress  (100%  completed) 

We  performed  in-vitro  in-vitro  proliferation,  migration  and  invasion  assays  on  PCAT29  KD  and 
overexpression  cells  (Figure  4).  As  shown  below,  KD  of  PCAT29  increased  cell  proliferation 
and  migration  and  overexpression  of  PCAT29  decreased  cell  proliferation  and  migration 
suggesting  a  tumor  suppressor  role  for  PCAT29. 


Figure  4:  Proliferation  and  migration  of  LNCaP  eells  stably  expressing  PCAT29  shRNA  and  DU145  eells 
expressing  PCAT29  expression  eonstmets.  Representative  mierographs  of  erystal  violet-stained  migrated 
cells  are  shown  as  insets.  Data,  mean  ±  SEM.  *,  P  <  0.05  by  the  Student  t  test. 


Similar  to  PCAT29,  we  performed  proliferation  and  apoptosis  assay  on  PC  ATS  1  KD  cells.  As 
shown  below  PC  ATS  1  KD  using  siRNA  reduced  cell  proliferation  and  induced  apoptosis 
suggesting  an  oncogenic  role  for  PCATSl.  Results  for  SCHLAP  are  now  published  (see 
appendix) 
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Figure  5:  Proliferation  of  MDA_PCa_2B,  LNCaP  and  DU145  cells  transfected  with  siRNA  targeting 
PC  ATS  1 .  Effect  on  apoptosis  was  determined  by  immunoblotting  for  cleaved  PARP 

Task  8:  To  perform  in-vivo  tumor  formation  and  metastasis  assay  using  stable  eell  lines  in 
eollaboration  with  Dr.  Felix  Feng  (Month  12-18). 

Progress  (100%  completed) 

We  performed  in-vivo  tumor  formation  assays  with  PCAT29-KD,  SCHLAP-KD  and  PCAT51 
KD  cell  lines.  Results  for  PCAT29  and  SCHLAP  are  now  published  (see  appendix).  As  shown 
below  PCAT5 1  KD  using  shRNA  reduced  tumor  growth  in-vivo. 
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Figure  6:  Growth  of  LNCaP-AR  xenografts  stably  expressing  control  shRNA  or  shRNA  targeting 
PCAT5 1 .  Data,  mean  ±  SEM.  *,  P  <  0.05  by  the  one  way  ANOVA. 
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Task  9:  To  explore  the  role  of  androgen  signaling  in  regulation  of  IncRNAs  (Month  12-18) 


To  address  the  role  of  AR  in  IncRNA  regulation,  we  performed  an  unbiased  discovery  of 
androgen  receptor  regulated  long  non-coding  RNA.  Transcriptome  sequencing  (RNA-seq)  on 
DHT  stimulated  AR  dependent  cell  lines  VCaP  and  LNCaP  was  performed.  The  figure  below 
describes  the  outline  of  the  experiment  performed. 


RNA-Sequencing 
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Figure  7:  Schematic  representation  of  experimental  outline.  B.  Heatmap  of  gene  expression  changes  after 
DHT  stimulation  in  LNCaP  and  VCaP  cells.  C.  Validation  of  50  coding  and  non-coding  transcripts  by  q- 

RT-PCR. 

Specific  Aim  2:  To  elucidate  the  function  of  IncRNA  in  metastatic  castrate  resistant 
prostate  cancer:  (Month  6-24) 

Task  1:  Microarray  analysis  (Month  6-12) 

1 .  Perform  microarray  after  IncRNA  knockdown  using  siRNA. 

2.  Training  with  bioinformaticians  on  analysis  of  microarray  data 

Progress  (100%  completed) 

We  performed  Microarray  after  KD  of  PCAT29  (Figure-8)  and  PCAT51  (Figure  -9).  The  data 
was  analyzed  and  Gene-Set  enrichment  analysis  was  performed  to  find  relevant  concepts. 
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Figure  8:  GSEA  analysis  of  Micorarray  data  obtained  after  PCAT29  knoekdown  in  VCaP  and  LNCaP 

eells.  Bars  represent  the  Geo  eategories  enriehed. 
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Figure  9:  GSEA  analysis  of  Mieorarray  data  obtained  after  PCAT51  knoekdown  in  MDA_PCa_2B  eells. 

Bars  represent  the  Geo  eategories  enriehed. 
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Training:  I  did  training  with  bioinformatician  in  lab  and  learned  how  to  analyze  microarray  data. 
Learned  to  use  LIMMA  package  in  R.  Other  software’s  such  as  MeV4  and  Treeview  were  also 
learned  for  heat  map  generation.  Further,  I  am  taking  courses  on  “R”  with  University  of 
Michigan  biostatistics  department. 

Task  2:  To  make  tilling  probes  for  IncRNAs.  (Month  9-12) 

Progress  (100%  completed) 

We  have  published  ChIRP  technique  with  SCHLAP  (see  appendix).  For  PCAT51  we  made  10 
tiling  arrays.  By  performing  ChIRP  we  have  determined  that  probe  1  and  2  are  the  best.  These 
probes  will  be  use  in  subsequent  experiments. 


■  Fcitner  1-2 

■  Pritner  3-4 

■  Prinier  5-6 

■  Primer  7-8 

■  AR-3*UTR 


Figure  10:  PCAT51  was  pull-down  using  5  set  of  2  probes  each.  Enrichment  of  PCAT51  compared  to 

input  was  determined  by  q-RT-PCR 


Task  3:  To  standardize  ChiRP  protocol.  (Month  9-12) 

Progress  (100%  completed) 

ChiRP  protocol  has  now  been  standardized.  We  published  this  in  SCHLAP  manuscript  (see 
appendix) 


Task  4:  To  perform  ChiRP  and  Mass  spectrometry  analysis  (Month  12-18). 

Progress  (100%  completed) 

After  few  rounds  of  optimization  we  have  decided  to  approach  this  problem  using  a  different 
technique.  In  order  to  identify  RNA  interacting  proteins  we  are  now  performing  in-vitro  RNA 
pull-downs  following  mass-spectrometry.  We  used  PC  ATS  1  as  our  candidate  for  this  approach. 
Briefly,  RNA  was  synthesized  in-vitro  and  labeled  with  BrU.  Labelled  RNA  was  then  used  to 
pulldown  protein.  Mass-spectrometry  was  used  to  identify  the  proteins  bound  to  RNA. 
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Figure  1 1 :  Schematic  representation  of  the  BrU  pulldown  approach.  List  of  protein  in  nucleus  and 
cytoplasm  that  were  found  to  be  associate  with  PCAT51. 


Task  3:  Correlation  analysis  (Month  12-18):  Correlation  data  for  PCAT29  and  SCHLAP  are 
puiblished.  A  similar  analysis  was  performed  for  PCAT51.  Correlated  genes  were  then  analysed 
by  GSEA  and  oncomine.  As  shown  below  positively  correlated  genes  were  significantly 
enriched  in  prostate  cancer  concepts  suggesting  role  of  PCAT5 1  in  cancer  progression. 
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Task  4:  Validation  of  microarray  and  ChiRP  results  (Month  18-24). 
Progress  (0%  completed) 

In  progress 
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Specific  Aim  3:  To  identify  IncRNAs  that  serve  as  potential  biomarkers  of  disease 
progression.  (Month  12-24) 

Task  1:  Screening  of  IncRNA  in  prostate  cancer  cohorts  (Month  12-24):  We  have  screened 
PCAT29  and  SCHLAP  in  cohort  generated  using  samples  from  University  of  Michigan.  Other 
IncRNA  candidates  will  be  screened  in  this  cohort.  Other  independent  prostate  cancer  cohorts 
will  also  be  screened  for  IncRNA  expression. 

Progress  (100%  completed) 

Expression  of  PCAT51  was  assessed  in  RNA-seq  data  obtained  from  University  of  Michigan  as 
well  as  other  sources  such  as  TGCA  and  other  prostate  cancer  studies.  As  shown  below  PCAT51 
was  significantly  expressed  in  prostate  cancer  compared  to  benign  samples. 


300-1 


Normal/benign 
Localized  tumor 
Metastatic  tumor 


Figure  11:  Expression  of  PCAT51  in  various  prostate  datasets. 


Task  2:  Screening  of  IncRNAs  in  FFPE  cohort  (Month  12-24). 

Progress  (0%  completed) 

Task  3:  Fresh  Frozen  prostate  tissue  cohort:  Samples  from  Rapid  Autopsy  program  will  be 
obtained  in  collaboration  with  Dr.  Pienta  and  IncRNA  expression  will  be  measured  followed  by 
analysis  similar  to  FFPE  cohort.  (Month  1-24). 

Progress  (0%  completed) 

Task  4:  Microarray  analysis  using  data  from  Genome  DX.  (Month  18-24) 

Progress  (0%  completed) 
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What  opportunities  for  training  and  professional  development  has  the  project  provided? 

Several  opportunities  for  training  and  professional  development  were  provided  while  working 
on  this  projeet. 

One-on-one  meeting  with  mentor:  Data  was  discussed  with  mentor  (Dr.  Arul  Chinnaiyan)  in 
biweekly  individual  meetings  and  via  monthly  progress  reports.  Constant  inputs  and  advice 
were  provided. 

Presentations  at  Scientific  Conference:  Data  obtained  was  presented  at  several  scientific 
conferences: 

1 .  Annual  AACR  conference. 

2.  Keystone  Symposia  Conference.  “Long  Noncoding  RNAs:  Marching  toward 
Mechanism”  2014 

3.  Sixth  Annual  Prostate  Cancer  Program  Retreat 

Manuscript  writing:  A  manuscript  describing  the  results  was  published  earlier  this  year. 

Weekly  lab  meetings:  All  important  research  findings  were  presented  during  weekly  lab 
meetings  in  front  of  the  group. 


How  were  the  results  disseminated  to  communities  of  interest? 

Results  from  the  grant  were  published  in  form  on  a  manuscript  as  well  as  in  form  of  conference 
proceeding,  (see  appendix) 

What  do  you  plan  to  do  during  the  next  reporting  period  to  accomplish  the  goals? 

By  next  reporting  period  I  anticipate  to  accomplish  all  the  aims  proposed  in  the  study.  We  also 
anticipate  to  have  published  another  manuscript  describing  results  on  IncRNA  PCAT51. 

3.  IMPACT: 

What  was  the  impact  on  the  development  of  the  principal  discipline(s)  of  the  project? 

Once  completed,  the  proposed  project  “Biological  and  clinical  translation  of  novel  long  non¬ 
coding  RNAs  (IncRNAs)  associated  with  metastatic  castration-resistant  prostate  cancer” 

will  have  substantial  impact  on  understanding  prostate  cancer  biology  and  its  clinical 
management.  We  have  proposed  to  study  the  role  of  long  non-coding  RNAs  (IncRNAs)  in 
development  and  progression  of  prostate  cancer  as  well  as  explore  the  potential  of  IncRNA  to 
serve  as  biomarkers  to  predict  disease  progression. 

Till  data  we  have  discovered  several  new  IncRNA  molecules  that  play  important  role  in  prostate 
cancer  progression.  One  such  IncRNA,  PCAT29,  was  shown  to  be  the  first  androgen  receptor- 
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repressed  IncRNA  that  funetions  as  a  tumor  suppressor.  We  also  show  that  loss  of  PCAT29  may 
identify  a  subset  of  patients  at  higher  risk  for  disease  recurrenee.  PCAT29  based  biomarkers  ean 
be  useful  in  prostate  eaneer  management  in  the  clinies. 

We  anticipate  that  at  the  end  of  this  proposed  project,  we  will  be  able  to  discover  several  such 
IncRNAs  that  can  predict  disease  progression  and  will  be  able  to  stratify  patient  into  high  risk  vs 
low  risk  disease. 

What  was  the  impact  on  other  disciplines? 

Nothing  to  report 

What  was  the  impact  on  technology  transfer? 

Nothing  to  report 

What  was  the  impact  on  society  beyond  science  and  technology? 

Nothing  to  report 


4.  CHANGES/PROBLEMS:. 

Changes  in  approach  and  reasons  for  change 

To  identify  protein  that  binds  to  IncRNA  we  initially  proposed  to  perform  ChlRP 
following  mass  spectrometry.  However,  due  to  incompatibility  of  ChIRP  ragents  with 
mass-spec,  we  utilized  a  slightly  different  approach.  We  performed  pull-down  using  in- 
vitro  labeled  RNA.  Interacting  proteins  were  then  identified  by  mass-spectrometry. 

Actual  or  anticipated  problems  or  delays  and  actions  or  plans  to  resolve  them 

None 

Changes  that  had  a  significant  impact  on  expenditures 

None 

Significant  changes  in  use  or  care  of  human  subjects,  vertebrate  animals,  biohazards, 
and/or  select  agents 

No  Changes 

Institutional  Animal  Care  and  Use  Approval  Number  and  Dates: 
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PR000004337,  12/12/2012-  12/12/2015  (Mouse) 

Significant  changes  in  use  or  care  of  human  subjects: 

None 

Significant  changes  in  use  or  care  of  vertebrate  animals. 

None 

Significant  changes  in  use  of  biohazards  and/or  select  agents. 

None 


5.  PRODUCTS: 

Publications,  conference  papers,  and  presentations 
Journal  publications: 

Malik  R,  Patel  L,  Prensner  JR,  Shi  Y,  Iyer  M,  Subramaniyan  S,  Carley  A,  Niknafs  YS, 
Sahu  A,  Han  S,  Ma  T,  Liu  M,  Asangani  lA,  Jing  X,  Cao  X,  Dhaneshekaran  SM, 
Robinson  D,  Feng  FY,  Chinnaiyan  AM.  The  IncRNA  PCAT29  Inhibits  Oncogenic 
Phenotypes  in  Prostate  Cancer.  Mol  Cancer  Res.  2014  Aug;12(8):  1081-7.  doi: 
10.1158/1541-7786.MCR-14-0257.  PMCID:  PMC4135019. 

Acknowledgement  of  federal  support:  Yes. 

Prensner,  J.R.*,  A.  Sahu*,  M.K.  Iyer*,  Malik  R.*,  B.  Chandler,  LA.  Asangani,  A. 
Poliakov,  I.A.  Vergara,  M.  Alshalalfa,  R.B.  Jenkins,  E.  Davicioni,  F.Y.  Feng,  and  A.M. 
Chinnaiyan.  2014b.  The  IncRNAs  PCGEMl  and  PRNCRl  are  not  implicated  in 
castration  resistant  prostate  cancer.  Oncotarget.  (*Co-first  Author) 

Acknowledgement  of  federal  support:  Yes. 


Other  publications,  conference  papers,  and  presentations. 


Rohit  Malik,  Matthew  K.  Iyer,  John  R.  Prensner,  Lalit  Patel,  Sumin  Han,  Wei  Chen, 
Felix  Feng,  Arul  M.  Chinnaiyan.  Identification  and  characterization  of  a  novel  androgen- 
regulated  long  non-coding  RNA  in  prostate  cancer,  [abstract].  In:  Proceedings  of  the 
104th  Annual  Meeting  of  the  American  Association  for  Cancer  Research;  2013  Apr  6-10; 
Washington,  DC.  Philadelphia  (PA):  AACR;  Cancer  Res  2013;73(8  Suppl): Abstract  nr 
1120.  doi:10.1158/1538-7445.AM2013-l  120  (*) 
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Rohit  Malik,  Matthew  K  Iyer,  Shruthi  Subramaniam,  Yasuyuki  Hosono,  Anirban  Sahu, 
Xia  Jiang,  Yang  Shi,  Vishal  Kothari,  Xuhong  Cao,  Dan  Robinson,  Saravana  M. 
Dhanasekaran,  Felix  Y  Feng  and  Ami  M  Chinnaiyan.  Keystone  Symposia  Conference. 
“Long  Noncoding  RNAs:  Marching  toward  Mechanism”  2014 

Rohit  Malik,  Matthew  K.  Iyer,  John  R.  Prensner,  Lalit  Patel,  Sumin  Han,  Wei  Chen, 
Felix  Feng,  Ami  M.  Chinnaiyan.  “Sixth  Annual  Prostate  Cancer  Program  Retreat”. 
March  18-20,  2013(*) 

Technologies  or  techniques 

Nothing  to  Report. 

Inventions,  patent  applications,  and/or  licenses 

Nothing  to  Report. 

Other  Products 

Nothing  to  Report. 


6.  PARTICIPANTS  &  OTHER  COLLABORATING  ORGANIZATIONS 
What  individuals  have  worked  on  the  project? 


Name: 


Rohit  Malik  -  No  change 


Has  there  been  a  change  in  the  active  other  support  of  the  PD/PI(s)  or  senior/key  personnel 
since  the  last  reporting  period? 

New  Award: 

AwardNp:  N/A  (PI:  Malik)  01/20/2014-01/20/2017  0  cal  months  effort 

Sponsor:  Prostate  Cancer  Foundation  Young  Investigator  Award 

Title:  Characterization  and  therapeutic  targeting  of  androgen  receptor  co-activators  in 
castration  resistance  prostate  cancer 

Goals:  The  goals  of  this  project  are  three  fold.  1)  To  develop  highly  potent  small 
molecule  inhibitors  of  the  menin-MLL  interaction  with  optimized  dmg-like  properties.  2) 
To  study  the  mechanistic  effects  of  pharmacologic  menin-MLL  inhibition  on  AR 
regulated  signaling  and  tumor  growth.  3)  To  establish  the  efficacy  of  menin-MLL 
inhibitors  in  vivo  employing  CRPC  pre-clinical  models. 

Overlap:  None 
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Role:  PI,  Young  Investigator 


7.  SPECIAL  REPORTING  REQUIREMENTS 
o  None 


8.  APPENDICES:  Attach  all  appendices  that  contain  information  that  supplements, 
clarifies  or  supports  the  text.  Examples  include  original  copies  of  journal  articles, 
reprints  of  manuscripts  and  abstracts,  a  curriculum  vitae,  patent  applications,  study 
questionnaires,  and  surveys,  etc. 

1.  Malik  et.  al.,MCR  2014 

2.  Prensner  et.  al.,  Nat  Gen  2013 

3.  CV_Rohit  Malik 
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Abstract 

Long  noncoding  RNAs  (IncRNA)  have  recendy  been  associated  with  the  development  and  progression  of  a 
variety  of  human  cancers.  However,  to  date,  the  interplay  between  known  oncogenic  or  tumor-suppressive  events 
and  IncRNAs  has  not  been  well  described.  Here,  the  novel  IncRNA,  prostate  cancer-associated  transcript  29 
{PCAT29),  is  characterized  along  with  its  relationship  to  the  androgen  receptor.  PCAT29  is  suppressed  by  DHT 
and  up  regulated  upon  castration  therapy  in  a  prostate  cancer  xenograft  model.  PCA  T29  knockdown  significantly 
increased  proliferation  and  migration  of  prostate  cancer  cells,  whereas  PCAT29  overexpression  conferred  the 
opposite  effect  and  suppressed  growth  and  metastases  of  prostate  tumors  in  chick  chorioallantoic  membrane  assays. 
Finally,  in  prostate  cancer  patient  specimens,  low  PCAT29  expression  correlated  with  poor  prognostic  outcomes. 
Taken  together,  these  data  expose  PCAT29  as  an  androgen-regulated  tumor  suppressor  in  prostate  cancer. 

Implications:  This  study  identifies  PCAT29  as  the  first  androgen  receptor-repressed  IncRNA  that  functions  as  a 
tumor  suppressor  and  that  its  loss  may  identify  a  subset  of  patients  at  higher  risk  for  disease  recurrence. 

Visual  Overview:  http://mcr.aacrjournals.Org/content/early/20l4/07/31/l 541-7786.MCR-14-0257/F1  .large.jpg. 
Mol  Cancer  Res;  12(8);  1081-7.  ©2014  AACR. 


Introduction 

Recently,  data  from  the  ENCODE  project  have  revealed 
that  the  majority  of  the  transcrip  tome  is  composed  of 
noncoding  RNAs  (1).  While  the  classification  of  these 
noncoding  RNAs  is  still  in  development,  long  noncoding 
RNAs  (IncRNA)  are  of  particular  interest,  given  the  similar 
features  they  share  with  protein-coding  genes  as  well  as 
recent  evidence  of  their  roles  in  cancer  biology  (2,  3). 
LncRNAs  are  polyadenylated  RNA  species  that  are  more 
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than  200  bp  in  length,  transcribed  by  RNA  polymerase  II, 
and  associated  with  common  epigenetic  signatures  such  as  of 
histone  3  lysine  4  trimethylation  (H3K4me3)  at  the  tran¬ 
scriptional  start  site  (TSS)  and  histone  3  lysine  36  trimethy¬ 
lation  (H3K36me3)  in  the  gene  body  (4).  Several  IncRNAs 
have  been  shown  to  play  a  role  in  biologic  processes  such  as 
X-chromosomal  inactivation,  pluripotency  (5),  and  gene 
regulation  (6).  Recently,  several  IncRNAs  have  been  impli¬ 
cated  in  cancer  initiation  and  progression  (3,  7).  Apart  from 
their  role  in  tumor  initiation  and  progression,  IncRNAs  have 
been  shown  to  be  promising  biomarkers.  In  prostate  cancer, 
PCA3  is  a  well-studied  prostate  cancer  biomarker  that  is  now 
available  for  clinical  use  as  a  urine  biomarker  assay  for 
diagnosis  of  prostate  cancer  (8,  9). 

Despite  their  involvement  in  various  cellular  processes,  the 
majority  of  IncRNAs  are  uncharacterized,  and  their  role  in 
cancer  initiation  and  progression  remains  unclear.  Using 
transcriptome  sequencing,  our  group  recently  identified 
more  than  1 00  IncRNAs,  named  prostate  cancer-associated 
transcripts  {PCATs),  which  are  differentially  expressed  or 
have  outlier  profiles  in  prostate  cancer  versus  normal  tissue 
(3) .  Here  we  find  that  one  of  these  novel  IncRNAs,  PCA  T29, 
exhibits  cancer-suppressive  phenotypes,  including  inhibi¬ 
tion  of  cell  proliferation,  migration,  tumor  growth,  and 
metastases.  In  accordance  with  this,  PCAT29  is  repressed 
by  androgen  signaling,  and  low  PCA  T29  expression  associ¬ 
ates  with  worse  clinical  outcomes. 


www.aacrjournals.org 
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Materials  and  Methods 

Cell  lines  and  reagents 

Prostate  cancer  cells  were  cultured  as  follows:  VCaP  cells 
in  DMEM  with  GlutaMAX  (Invitrogen)  and  LNCaP  and 
DU  145  cells  in  RPMI-1640  (Invitrogen)  in  a  5%  C02  cell 
culture  incubator.  All  the  media  were  supplemented  with 
10%  FBS  (Invitrogen)  and  1%  penicillin-streptomycin 
(Invitrogen).  All  cell  lines  were  purchased  from  ATCC  and 
were  authenticated. 

For  stable  knockdown  of  PCAT29,  LNCaP  and  VCaP 
cells  were  transfected  with  lentiviral  constructs  encoding  2 
different  PCAT29  shRNAs  or  nontargeting  shRNAs  in  the 
presence  of  polybrene  (8  jIg/mL;  Supplementary  Table 
SI  A).  After  48  hours,  transduced  cells  were  grown  in  culture 
media  containing  3  to  5  |Ig/mL  puromycin.  For  PCAT29 
overexpression,  2  isoforms  of  PCAT29  were  generated  by 
subcloning  the  PCR  product  into  the  CPOl  sites  of  the 
pCDH-CMV  vector  (System  Biosciences).  Five  hundred 
base  pairs  of  the  genomic  region  was  attached  at  the  5'  end  of 
each  isoform.  Lentiviral  particles  were  made  and  DU145 
cells  were  transduced  as  described  above. 

Gene  expression  by  quantitative  PCR 

Total  RNA  was  isolated  using  TRIzol  (Invitrogen)  and  an 
RNeasy  kit  (Qiagen)  according  to  manufacturers'  instruc¬ 
tion.  Total  RNA  was  reverse  transcribed  into  cDNA  using 
Superscript  III  and  random  primers  (Invitrogen).  Quanti¬ 
tative  PCR  (qPCR)  was  performed  using  SYBR  Creen 
Master  Mix  (Applied  Biosystems)  on  an  Applied  Biosystems 
7900HT  Real-Time  System.  The  relative  quantity  of  the 
target  gene  was  computed  for  each  sample  using  the  AAQ 
method  by  comparing  mean  Q  of  the  gene  to  the  mean  Q  of 
the  housekeeping  gene  GAPDH.  All  the  primers  were 
obtained  from  Integrated  DNA  Technologies  (IDT). 
Sequences  of  all  the  primers  used  are  listed  in  Supplementary 
Table  SIB. 

Rapid  amplification  of  cDNA  ends) 

5'  and  3^  RACE  was  performed  using  the  CeneRacer 
RLM-RACE  kit  (Invitrogen)  following  manufacturer's 
instruction.  RACE  PCR  products  were  separated  on  a 
1%  agarose  gel.  Individual  bands  were  gel  purified,  cloned 
in  pcr4-TOPO  vector,  and  sequenced  using  M13  primers. 

Expression  of  PCAT29  after  castration  in  prostate  tumor 
xenograft  model 

Five-week-old  male  nude  athymic  BALB/c  nuinu  mice 
(Charles  River  Laboratory)  were  used  for  xenograft  stud¬ 
ies.  LNCaP  cells  were  resuspended  in  100  jiL  of  PBS  with 
20%  Matrigel  (BD  Biosciences)  and  implanted  subcuta¬ 
neously  into  the  left  flank  regions  of  the  mice.  Mice  were 
castrated  and  euthanized  5  days  after  castration.  RNA  was 
extracted  from  the  xenografts  and  expression  of  PCA  T29 
and  FKBP5  was  measured.  All  experimental  procedures 
involving  mice  were  approved  by  the  University  Com¬ 
mittee  on  Use  and  Care  of  Animals  at  the  University  of 
Michigan  (Ann  Arbor,  MI)  and  conform  to  their  relevant 
regulatory  standards. 


Chromatin  immunoprecipitation 

Chromatin  immunoprecipitation  (ChIP)  was  performed 
with  polyclonal  androgen  receptor  antibody  (Millipore 
PC21)  using  HiCell  ChIP  kit  (Diagenode)  following  man¬ 
ufacturer's  instruction.  Briefly,  cells  were  treated  with  10 
jImol/L  MDV3100  or  10  jImol/L  bicalutamide  16  hours 
before  the  treatment  with  10  nmol/L  DHT  for  12  hours. 
Approximately  1  million  cells  were  cross-linked  per  antibody 
with  1%  formaldehyde.  Chromatin  was  sonicated  to  an 
average  length  of  500  bp  and  centrifuged  to  remove  debris. 
Magnetic  protein-G  beads  were  coated  with  6  jig  of  antibody 
and  incubated  with  chromatin  overnight  at  4°C.  Protein- 
chromatin-antibody  complexes  were  washed  thrice  and 
cross-linking  was  reversed.  ChIP  products  were  cleaned 
using  IPure  kit  (Diagenode).  Eluted  DNA  was  quantified 
by  RT-PCR  using  primers  described  in  Supplementary 
Table  SIB. 

Cell  proliferation  and  migration  assay 

LNCaP  and  DU145  cells  stably  expressing  PCAT29 
shRNA-1  and  2  or  PCAT29  isoform  1  and  2  were  seeded 
in  24-well  plates.  Cells  were  trypsinized  and  counted  by 
using  Coulter  Counter  (Beckman  Coulter)  at  the  indicated 
time  points  in  triplicate.  For  migration  assays,  approximately 
1x10^  cells  were  seeded  in  the  upper  chamber  of  a  Boyden 
chamber.  About  500  jlL  of  complete  medium  (10%  FBS) 
was  added  to  the  lower  chamber  as  a  chemoattractant.  Forty- 
eight  hours  after  seeding,  cells  on  the  upper  surface  were 
removed  using  a  cotton  swab.  Inserts  were  fixed  with  3.7% 
formaldehyde  and  migrated  cells  on  the  lower  surface  of  the 
membrane  were  stained  with  crystal  violet.  The  inserts  were 
treated  with  10%  acetic  acid,  and  absorbance  was  measured 
at  560  nm. 

Gene  expression  microarray 

Expression  profiling  of  VCaP  and  LNCaP  cells  after 
PCAT29  knockdown  was  performed  using  the  Agilent 
Whole  Human  Genome  Oligo  Microarray  as  described 
(7).  GEO  accession  number:  GSE58397. 

Chicken  chorioallantoic  membrane  assay 

22RV 1  cells  were  transduced  with  empty  vector  (pcDH) 
or  PCAT29-iso^or:mA.  A  total  of  10^  cells  were  inoculated 
on  the  chicken  chorioallantoic  membrane  (CAM)  assay  as 
described  previously  (10).  For  tumor  growth  and  metastasis, 
the  eggs  were  incubated  for  1 8  days  in  total,  after  which  the 
extra-embryonic  tumor  were  exercised  and  weighed,  and  the 
embryonic  livers  were  harvested  and  analyzed  for  the  pres¬ 
ence  of  tumor  cells  by  quantitative  human  Alu-specific  PCR. 
Quantification  of  human  cells  in  the  extracted  DNA  was 
performed  as  described  (11).  Fluorogenic  TaqMan  qPCR 
probes  were  applied  as  above  and  DNA  copy  numbers  were 
quantified. 

Kaplan-Meier  analysis  of  PCAT29 

For  outcomes  analysis,  PCAT29  expression  was  deter¬ 
mined  on  a  cohort  of  51  radical  prostatectomy  specimens 
from  patients  with  prostate  cancer  at  the  University  of 
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Figure  1.  Characterization  of  PCAT29.  A,  genome  browser  representation  of  PCAT29.  Current  gene  annotations  from  Ensembi,  ENCODE,  UCSC  genes  and 
incRNA  databases  are  shown.  ChiP-Seq  data  for  H3K4me3  and  DNasei  HS  signai  in  LNCaP  ceiis  obtained  from  ENCODE.  B,  schematic  representation 
of  PCAT29  isoforms  as  determined  by  RACE  anaiysis.  C,  nuciear  and  cytopiasmic  distribution  of  various  noncoding  and  protein-coding  transcripts 
in  LNCaP  and  VCaP  ceiis.  Error  bar,  ±SEM. 


Michigan  with  long-term  biochemical  recurrence  outcomes. 
Biochemical  recurrence  was  defined  by  an  increase  of  PSA  of 
0.2  ng/mL  over  the  PSA  nadir  following  prostatectomy. 
PCAT29  expression  was  determined  by  a  SYBR-Green 
qPCR  assay  using  the  average  of  GAPDH  +  HMBS  for 
data  normalization  using  the  AAQ  method.  Expression  data 
were  transformed  using  a  2;-score  and  patients  were  defined  as 
high  (top  33%  of  patients)  or  low  (bottom  66%  of  patients) 
for  PCAT29  expression.  Kaplan— Meier  curves  for  biochem¬ 
ical  recurrence-free  survival  were  generated  for  PCAT29- 
high  and  PCA  T29-\ow  patients  using  the  GraphPad  Prism 
program.  Statistical  significance  was  determined  with  a  log- 
rank  test. 

Results 

PCAT29  is  a  novel  long  nuclear  noncoding  RNA 

Using  RNA-Seq  data  from  prostate  cancer  tissues,  we 
previously  identified  121  IncRNAs,  named  PGATs,  which 
demonstrate  differential  expression  or  outlier  profiles  in 
prostate  cancer  compared  with  normal  tissue  (3).  Here  we 
characterize  and  functionally  investigate  one  of  the  top  outlier 
IncRNAs,  PCAT29  (Ensembi  ID  ENSG00000259641). 
Using  the  predicted  transcript  structures,  we  designed  exon 
spanning  primers  and  performed  rapid  amplification  of 
cDNA  ends  (RAGE)  to  determine  the  full  exon  structure. 
As  shown  in  a  genome  browser  view,  PCAT29  is  a  694-bp 
polyadenylated  transcript  present  on  chrl5(q23),  and  the 
PCAT29  gene  spans  over  a  10-kb  stretch  (Eig.  lA;  Supple¬ 


mentary  Eig.  SI  A).  PCAT29  is  composed  of  6  exons  that 
are  alternatively  spliced  to  produce  multiple  isoforms  (Eig. 
IB).  To  further  characterize  PCAT29,  we  interrogated 
recently  published  ENGODE  data  for  H3K4  trimethylation 
(H3K4me3)  and  DNasei  hypersensitive  sites  (DNaseH), 
marks  that  predicts  for  open  chromatin  state  and  are  com¬ 
monly  found  near  or  at  the  TSSs,  generated  in  the  prostate 
cancer  cell  line  LNGaP  (4).  We  found  several  DNaseH  and 
H3K4  trimethylation  peaks  at  the  TSS  of  PCAT29, 
suggesting  that  PCAT29  is  an  actively  transcribed  gene 
(Eig.  lA). 

To  confirm  that  PGAT29  is  indeed  a  noncoding  RNA,  we 
assessed  the  protein-coding  potential  of  PCA  T29  using  the 
coding  potential  calculator  (GPG)  algorithm,  which  discri¬ 
minates  coding  genes  (positive  score)  from  noncoding  tran¬ 
scripts  (negative  score;  ref.  12).  TG4Z2i^had  a  GPG  score  of 
—0.8921,  whereas  protein-coding  genes  such  as  TP53  and 
^-actin  scored  +8.25  and  +3.70,  respectively  (Supplemen¬ 
tary  Fig.  SIB).  Gonsistent  with  this  finding,  we  found  that  in 
both  LNGaP  and  VGaP  cells,  expression  of  PCAT29  was 
limited  to  nucleus,  whereas  other  protein-coding  mRNAs, 
such  as  GAPDH  and  ^-actin,  were  expressed  in  cytoplasm 
(Fig.  IG).  We  then  verified  the  expression  of  PCAT29  in 
various  prostate  cancer  cell  lines  (LNGaP,  VGaP,  22RV1, 
DU  145,  PG3)  and  immortalized  or  primary  prostate  epi¬ 
thelial  cells  (RPWE  and  PrEG).  PCAT29  expression  was 
highest  in  androgen  receptor-dependent  cell  lines  such  as 
LNGaP,  VGaP,  and  22RV1  (Supplementary  Fig.  SIG). 
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Next,  we  assessed  the  expression  of  PCAT29  in  various 
tissues  using  transcriptome  sequencing  data.  PCAT29 
expression,  although  not  limited  to  prostate,  was  enriched 
in  prostate  samples  compared  with  other  tissues  (Supple¬ 
mentary  Fig.  SID). 

Androgen  receptor  binds  to  the  PCAT29  promoter  and 
regulates  PCAT29  expression 

We  next  examined  the  effect  of  androgen  receptor  signaling 
on  PCAT29  in  LNCaP  cells  stimulated  with  10  nmol/L 
DHT.  As  shown  in  Fig.  2A,  PCAT29  expression  was  sup¬ 
pressed  upon  stimulation  with  DHT  in  a  time-dependent 
fashion  both  in  LNCaP  and  VCaP  cells.  In  contrast,  expres¬ 
sion  of  canonical  androgen  receptor  target  genes,  such  as 
FKBP5  and  KLK3,  was  increased  upon  stimulation  (Supple¬ 
mentary  Fig.  S2A).  To  examine  whether  the  suppression 
of  PCAT29  was  androgen  receptor-specific,  LNCaP  cells 


were  pretreated  with  the  androgen  receptor  antagonists 
MDV3100  or  bicalutamide  before  treatment  with  DHT. 
As  expected,  DHT  stimulation  suppressed  the  expression  of 
PCA  T29,  and  pretreatment  with  MDV3 1 00  or  bicalutamide 
rescued  this  suppression.  Similarly,  expression  o^PCAT29  'vs\ 
LNCaP  cells  grown  in  charcoal-stripped  media  as  well  as  in  an 
androgen  receptor-independent  variant  of  LNCaP  cells 
(C42)  was  higher  than  in  cells  grown  in  serum-containing 
media  and  LNCaP  cells,  respectively  (Supplementary  Fig. 
S2B  and  S2C).  We  next  investigated  whether  androgen 
receptor  suppresses  the  expression  of  PCAT29  in  vivo. 
LNCaP  xenografts  were  established  in  mice  followed  by 
physical  castration  to  ablate  androgen  receptor  signaling.  As 
expected,  5  days  of  castration  led  to  significant  increase  in  the 
expression  of  PCAT29  in  tumors  (Fig.  2C).  In  contrast, 
expression  of  FKBP5  was  reduced  in  tumors  from  castrated 
mice.  Taken  together,  our  results  suggest  that  stimulation 
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Figure  2.  Androgen  receptor  binds  to  the  promoter  of  PCAT29  and  suppresses  its  expression.  A,  expression  of  PCAT29  in  LNCaP  and  VCaP  cells  treated 
with  10  nmol/L  DHT  for  indicated  time  points.  B,  expression  of  PCAT29  in  LNCaP  cells  treated  with  10  nmol/L  DHT  in  the  presence  or  absence  of 
10  |imol/L  MDV3100  or  bicalutamide.  C,  expression  of  PCAT29  and  FKBP5  in  LNCaP  xenografts  obtained  from  control  mice  and  mice  that  were  physically 
castrated  for  5  days.  D,  genome  browser  representation  of  androgen  receptor  (AR)  binding  on  the  promoter  of  PCAT29  before  and  after  stimulation 
with  1  nmol/L  R1881 .  Consensus  androgen-responsive  elements  (ARE)  and  ARE  present  in  the  PCAT29  promoter  are  shown.  Inset,  ChIP-PCR  to  confirm 
AR  occupancy  on  TMPRSS2  and  PCAT29  gene  promoter.  They-axis  represents  AR  ChIP  enrichment  in  VCaP  cells  treated  with  10  nmol/L  DHT 
normalized  to  ethanol  (Ethl)-treated  cells.  Bars,  SEM. 
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of  androgen  receptor  leads  to  suppression  of  PCAT29 
expression. 

To  further  study  the  association  of  PCAT29  expression 
with  androgen  signaling,  we  interrogated  published  ChlP- 
Seq  data  (13)  and  found  androgen  receptor-binding  sites  in 
the  promoter  region  of  PCA  T29  (Fig.  2D) .  These  peaks  were 
similar  to  those  observed  in  other  known  androgen  receptor- 
regulated  genes  (Supplementary  Fig.  S2D).  Upon  closer 
inspection,  we  found  a  canonical  androgen  receptor-bind¬ 
ing  site  near  the  PCAT29  TSS  in  a  putative  enhancer  region 
bounder  by  androgen  receptor  (Fig.  2D).  We  confirmed  our 
ChIP-Seq  data  by  performing  ChIP  for  androgen  receptor 
followed  by  PCR  for  the  PCAT29  promoter.  As  shown 
in  Fig.  2D,  stimulation  of  VCaP  cells  with  DHT  led  to  an 
increase  in  association  of  androgen  receptor  with  the 
PCAT29  promoter.  This  association  was  reduced  in  cells 
pretreated  with  bicalutamide  and  MDV3 1 00.  Taken  togeth¬ 
er,  our  data  suggest  that  androgen  receptor  can  directly  bind 
to  the  promoter  of  PCA  T29  and  leads  to  the  suppression  of 
gene  expression. 

PCAT29  regulates  oncogenic  phenotypes  in  vitro  and  in 
vivo 

The  androgen  receptor  drives  oncogenesis  in  treatment- 
naive  prostate  cancer  as  well  as  disease  progression  in 
castration-resistant  prostate  cancers.  Because  androgen 
receptor  binds  to  the  PCA  T29  promoter  and  regulates  gene 
expression,  we  investigated  the  functional  role  of  PCAT29. 
Two  independent  shRNAs  were  designed  to  knockdown  the 
expression  of PCAT29 in  cells  (Supplementary  Fig.  S3A  and 
S3B).  VCaP  and  LNCaP  cells  were  transfected  with 
PCA  T29  shRNAs  following  analysis  using  gene  expression 
microarray.  We  found  GO  concepts  enriched  for  cell  cycle, 
proliferation,  and  migration-related  genes,  suggesting  a  role 
of  PCA  T29  in  proliferation  and  migration  (Supplementary 
Fig.  S3D-S3G).  Next,  we  defined  a  signature  of  genes 
positively  and  negatively  correlated  with  PCA  T29  expression 
from  prostate  cancer  samples  as  described  before  (7).  We 
checked  the  overlap  of  these  genes  with  the  top  1500 
differentially  expressed  genes  in  PCA  T29  knockdown  sam¬ 
ples  of  VGAP  and  LNGAP  cells.  As  expected,  the  positively 
correlated  genes  show  a  significant  overlap  with  genes  down- 
regulated  with  knockdown  of  PCAT29  and  the  negatively 
correlated  genes  show  a  significant  overlap  with  genes 
up  regulated  by  knockdown  of  PCAT29  in  both  VGAP  and 
LNGAP  {P  <  0.001  for  all  pairwise  comparisons  of  over¬ 
lapping  genes.  Supplementary  Fig.  S4A-S4D).  For  over¬ 
lapping  genes,  we  did  see  enrichment  in  pathways  such  as  cell 
cycle,  apoptosis,  and  cell  growth  (Supplementary  Fig.  S4A- 
S4D).  Taken  together,  this  analysis  suggested  a  role  of 
PCAT29  in  cell  proliferation  and  migration. 

To  experimentally  validate  this  observation,  cell  prolifer¬ 
ation  was  assessed  in  LNGaP  cells  transfected  with  control 
versus  PCAT29  shRNAs.  To  our  surprise,  knockdown  of 
PCAT29  in  LNGaP  cells  led  to  an  increase  in  cell  prolifer¬ 
ation  and  migration  (Fig.  3A).  To  further  validate  this 
observation,  we  stably  overexpressed  the  2  most  prevalent 
isoforms  of  PCAT29  in  DU  145  prostate  cancer  cells  using  a 


lentiviral  vector  (Supplementary  Fig.  S3G).  Gonsistent  with 
the  previous  knockdown  studies,  overexpression  of  these  2 
isoforms  of  PCAT29  in  DU  145  led  to  suppression  of  cell 
proliferation  and  migration  (Fig.  3B).  We  next  assessed 
whether  similar  effects  of  PCAT29  could  be  achieved  in 
vivo.  22RV1  prostate  cancer  cells  overexpressing  PCAT29 
(isoform- 1)  were  implanted  on  the  GAM  of  a  chicken  egg. 
Gompared  with  control  cells,  overexpression  of  PCAT29 
significantly  decreased  the  growth  of  tumor  on  the  GAM  as 
well  as  decreased  liver  metastases  (Fig.  3G). 

Finally,  we  measured  the  expression  of  PCAT29  in  an 
independent  cohort  of  5 1  radical  prostatectomy  specimens 
from  patients  with  prostate  cancer  with  localized  disease  and 
clinical  follow-up.  As  shown  in  Kaplan— Meier  analysis 
(Fig.  3D),  patients  with  lower  PCAT29  expression  had 
significantly  higher  rates  of  biochemical  recurrence,  consis¬ 
tent  with  our  in  vitro  and  in  vivo  findings. 

Discussion 

In  this  study,  we  characterize  the  novel  IncRNA  PCA  T29. 
Our  findings  demonstrate  that  PGAT29  is  directly  regulated 
by  the  androgen  receptor,  which  binds  to  the  promoter  of 
PCAT29  and  suppresses  its  transcription.  In  vitro  studies 
show  that  PCAT29  negatively  regulates  prostate  cancer 
proliferation  and  migration,  and  GAM  assays  demonstrate 
that  PCAT29  inhibits  tumor  growth  and  metastases.  Low 
expression  of  PCAT29  is  associated  with  higher  rates  of 
biochemical  recurrence,  suggesting  that  PCAT29  represses 
oncogenic  phenotypes  via  a  tumor-suppressive  role. 

While  previous  studies  have  nominated  and  characterized 
IncRNAs  that  are  dysregulated  in  cancer  (3, 14),  the  majority 
of  characterized  IncRNAs,  to  date,  has  been  associated  with 
oncogenic  roles  instead  of  tumor  suppressor  functions.  In 
fact,  there  have  been  only  a  handful  of  IncRNAs  identified  to 
date  that  function  in  repression  of  cancer  phenotypes,  and,  to 
our  knowledge,  none  of  these  are  targets  downregulated  by 
known  oncogenes  (14).  A  recent  study  identifies  a  protein¬ 
coding  gene,  GGN3/NOV,  as  a  tumor  suppressor  that  is 
repressed  by  androgen  receptor  (15).  Thus,  our  study 
represents  the  first  identification  of  an  androgen  receptor- 
repressed  IncRNA  functioning  as  a  tumor  suppressor.  While 
further  studies  will  be  required  to  identify  the  mechanism  of 
PCAT29  and  other  tumor  suppressor  IncRNAs,  it  is  clear 
that  these  IncRNAs  represent  an  intriguing  area  for  explo¬ 
ration  in  cancer  biology. 

In  the  context  of  prostate  cancer,  androgen-regulated 
IncRNAs  are  of  fundamental  importance,  given  that  all 
stages  of  prostate  cancer  are  exquisitely  dependent  on  andro¬ 
gen  receptor  signaling  for  growth  and  survival.  Because  the 
majority  of  clinically  relevant  prostate  cancer  therapies  target 
the  androgen  receptor,  our  studies  would  suggest  that 
inhibition  of  androgen  signaling  will  result  in  reactivation 
of  PCA  T29,  providing  another  mechanism  underlying  the 
effectiveness  of  androgen  deprivation  therapy. 

Glinically,  there  is  a  clear  need  for  identification  of 
prognostic  biomarkers  in  prostate  cancer  to  help  guide 
decisions  on  treatment  intensification.  The  association  of 
high  PCAT29  expression  with  good  clinical  prognosis  and 
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Figure  3.  PCAT29  suppresses  oncogenic  phenotypes.  A  and  B,  proliferation  and  migration  of  LNCaP  cells  stably  expressing  PCAT29  shRNA  (B)  and 
DU145  cells  expressing  PCAT29  expression  constructs  (C).  Representative  micrographs  of  crystal  violet-stained  migrated  cells  are  shown  as  insets.  C, 
quantification  of  tumor  weight  and  metastasis  to  liver  for  22Rv1  cells  expressing  PCA 7'29-isoform  1  or  empty  vector  (pcDH)  in  the  CAM  assay.  Data, 
mean  ±  SEM.  *,  P<0.05  by  the  Student  f  test.  D,  Kaplan-Meier  analyses  of  prostate  cancer  outcomes.  PCAT29  expression  was  measured  by  qPCRand  51 
patients  were  stratified  according  to  their  PCAT29  expression.  Patient  outcomes  were  analyzed  for  freedom  from  biochemical  recurrence. 


preclinical  suppression  of  cell  proliferation  and  tumor  metas- 
tases  suggests  that  decreased  expression  or  loss  of  PCA  T29 
may  identify  subsets  of  patients  who  may  require  further 
intensification  of  therapy.  As  our  clinical  cohort  was  com¬ 
posed  of  hormone-sensitive  disease  from  patients  with  pros¬ 
tatectomy,  further  studies  need  to  be  performed  to  deter¬ 
mine  whether  PCAT29  can  also  serve  as  a  prognostic 
biomarker  in  the  context  of  more  advanced,  castration- 
resistant  disease.  Regardless,  this  study  highlights  the  impor¬ 
tance  of  IncRNAs  in  prostate  cancer  biology  and  prognosis 
and  suggests  the  need  for  further  research  in  this  relatively 
unexplored  area. 
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The  long  noncoding  RNA  SChLAPl  promotes  aggressive 
prostate  cancer  and  antagonizes  the  SWI/SNF  complex 
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Prostate  cancers  remain  indolent  in  the  majority  of  individuals 
but  behave  aggressively  in  a  minority^'^.The  molecular 
basis  for  this  clinical  heterogeneity  remains  incompletely 
understood^"^.  Here  we  characterize  a  long  noncoding  RNA 
termed  SChLAPl  (second  chromosome  locus  associated  with 
prostate-1;  also  called  LINC00913)  that  is  overexpressed  in 
a  subset  of  prostate  cancers.  SChLAPl  levels  independently 
predict  poor  outcomes^  including  metastasis  and  prostate 
cancer-specific  mortality.  In  vitro  and  in  vivo  gain-of-function 
and  loss-of-function  experiments  indicate  that  SChLAPl 
is  critical  for  cancer  cell  invasiveness  and  metastasis. 
Mechanistically  SChLAPl  antagonizes  the  genome-wide 
localization  and  regulatory  functions  of  the  SWI/SNF 
chromatin-modifying  complex.  These  results  suggest  that 
SChLAPl  contributes  to  the  development  of  lethal  cancer  at 
least  in  part  by  antagonizing  the  tumor-suppressive  functions 
of  the  SWI/SNF  complex. 

With  over  200,000  new  cases  per  year,  prostate  cancer  will  be  diag¬ 
nosed  in  1  in  6  men  in  the  United  States  during  their  lifetime,  yet  only 
20%  of  individuals  with  prostate  cancer  have  a  high-risk  cancer  that 
represents  potentially  lethal  disease Whereas  mutational  events 
in  key  genes  characterize  a  subset  of  lethal  prostate  cancers^’^’^,  the 
molecular  basis  for  aggressive  disease  remains  poorly  understood. 

Long  noncoding  RNAs  (IncRNAs)  are  RNA  species  >200  bp  in 
length  that  are  frequently  polyadenylated  and  associated  with  tran¬ 
scription  by  RNA  polymerase  II  (ref.  7).  IncRNA-mediated  biology 
has  been  implicated  in  a  wide  variety  of  cellular  processes,  and,  in 
cancer,  IncRNAs  are  emerging  as  a  prominent  layer  of  transcriptional 
regulation,  often  by  collaborating  with  epigenetic  complexes^"^®. 

Here  we  hypothesized  that  prostate  cancer  aggressiveness  was  gov¬ 
erned  by  uncharacterized  IncRNAs  and  sought  to  identify  IncRNAs 


associated  with  aggressive  disease.  We  previously  used  RNA  sequenc¬ 
ing  (RNA-seq)  to  describe  121  new  IncRNA  loci  (out  of  >1,800)  that 
were  aberrantly  expressed  in  prostate  cancer  tissues^k  Because  only  a 
fraction  of  prostate  cancers  present  with  aggressive  clinical  features^, 
we  performed  cancer  outlier  profile  analysis^ ^  (COPA)  to  nominate 
intergenic  IncRNAs  selectively  upregulated  in  a  subset  of  cancers 
(Supplementary  Table  1).  We  observed  that  only  two,  PCAT-109  and 
PCAT-114,  which  are  both  located  in  a  ‘gene  desert’  on  chromosome 
2q31.3  (Supplementary  Fig.  1),  had  striking  outlier  profiles  distin¬ 
guishing  them  from  the  rest  of  the  candidates'^  (Fig-  la). 

Of  these  two  IncRNAs,  PCAT-114  was  expressed  at  higher  levels  in 
prostate  cell  lines,  and,  in  the  PCAT-114  region,  we  defined  a  1.4-kb 
polyadenylated  gene  composed  of  up  to  seven  exons  and  spanning 
nearly  200  kb  on  chromosome  2q31.3  (Fig.  lb  and  Supplementary 
Fig.  2a).  We  named  this  gene  second  chromosome  locus  associated 
with  prostate- 1  (SChLAPl)  after  its  genomic  location.  Published 
prostate  cancer  chromatin  immunoprecipitation  and  sequenc¬ 
ing  (ChIP-seq)  data^^  confirmed  that  the  transcriptional  start  site 
(TSS)  of  SChLAPl  was  marked  by  trimethylation  of  histone  H3 
at  lysine  4  (H3K4me3)  and  that  its  gene  body  harbored  trimethyl¬ 
ation  of  histone  H3  at  lysine  36  (H3K36me3)  (Fig.  lb),  an  epige¬ 
netic  signature  consistent  with  IncRNAs^^.  We  observed  numerous 
SChLAPl  splicing  isoforms,  of  which  three  (termed  isoforms  1,  2 
and  3)  constituted  the  vast  majority  (>90%)  of  transcripts  in  the  cell 
(Supplementary  Fig.  2b,c). 

Using  quantitative  PCR  (qPCR),  we  confirmed  that  SChLAPl  was 
highly  expressed  in  -25%  of  prostate  cancers  (Fig.  Ic).  SChLAPl 
was  found  to  be  expressed  more  frequently  in  metastatic  compared 
to  localized  prostate  cancers,  and  its  expression  was  associated  with 
ETS  gene  fusions  in  this  cohort  but  not  with  other  molecular  events 
(Supplementary  Fig.  2d,e).  A  computational  analysis  of  the  SChLAPl 
sequence  suggested  no  coding  potential,  which  was  confirmed 
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Figure  1  Identification  of  SChLAPI  as  a  prostate  cancer-associated  IncRNA.  (a)  COPA  for  intergenic  IncRNAs  (IncRNAs  defined  in  ref.  11). 

(b)  Representation  of  the  SChLAPI  gene  and  its  annotations  in  current  databases.  An  aggregated  representation  of  current  gene  annotations  for 
EnsembI,  the  Encyclopedia  of  DNA  Elements  (ENCODE),  the  UCSC  Genome  Browser,  RefSeq  and  Vega  shows  no  annotation  for  SChLAPI.  ChIP-seq 
data  for  H3K4me3  and  H3K36me3  show  enrichment  at  the  SChLAPI  gene.  Also,  RNA-seq  data  showing  an  outlier  sample  for  SChLAPI  demonstrate 
its  expression,  (c)  qPCR  for  SChLAPI  on  a  panel  of  benign  prostate  {n  =  33),  localized  prostate  cancer  {n  =  82)  and  metastatic  prostate  cancer  {n  =  33) 
samples.  qPCR  data  are  normalized  to  the  average  of  {GAPDH  HMBS)  and  are  represented  as  standardized  expression  values,  (d)  Fractionation  of 
prostate  cell  lysates  demonstrates  nuclear  expression  of  SChLAPI.  U1  RNA  serves  as  a  positive  control  for  nuclear  gene  expression.  Error  bars,  s.e.m. 

(e)  In  situ  hybridization  of  SChLAPI  in  human  prostate  cancer.  Histological  scores  for  localized  cancer  samples  are  indicated,  with  the  first  number 
representing  the  major  Gleason  score  and  the  second  number  representing  the  minor  Gleason  score.  SChLAPI  staining  is  shown  for  both  localized  and 
metastatic  tissues. 


experimentally  by  in  vitro  translation  assays  of  the  three  SChLAPI  iso¬ 
forms  (Supplementary  Fig.  3).  Additionally,  we  found  that  SChLAPI 
transcripts  were  located  in  the  nucleus  (Fig.  Id).  We  confirmed 
the  nuclear  localization  of  SChLAPI  transcripts  in  human  samples 
(Fig.  le)  using  an  in  situ  hybridization  assay  in  formalin-fixed, 
paraffin-embedded  prostate  cancer  samples  (Supplementary 
Fig.  4a,b  and  Supplementary  Note). 

An  analysis  of  SChLAPI  expression  in  localized  tumors  dem¬ 
onstrated  a  strong  correlation  with  higher  Gleason  scores,  a  his- 
topathological  measure  of  aggressiveness  (Supplementary  Fig.  4c,d 
and  Supplementary  Table  2).  Next,  we  performed  a  network  analy¬ 
sis  of  prostate  cancer  microarray  data  in  the  Oncomine^^  database 


using  signatures  of  SC/zLAPi -correlated  or  SC/zLAPi- anticorrelated 
genes,  as  SChLAPI  itself  is  not  measured  by  expression  microarrays 
(Online  Methods  and  Supplementary  Table  3a).  We  found  a  striking 
association  with  enriched  concepts  related  to  prostate  cancer  pro¬ 
gression  (Fig.  2a  and  Supplementary  Table  3b).  For  comparison, 
we  next  incorporated  disease  signatures  using  prostate  RNA-seq 
data  and  additional  known  prostate  cancer  genes,  including  EZH2 
(a  metastasis  gene^^),  PCA3  (a  IncRNA  biomarker^)  and  AMACR 
(a  tissue  biomarker"^),  as  well  as  ACTB  (encoding  p-actin)  as  a  control 
(Supplementary  Fig.  5,  Supplementary  Table  3c-i  and  Supplementary 
Note).  A  heatmap  visualization  of  significant  comparisons  con¬ 
firmed  a  strong  association  of  SC/zLAPi -correlated  genes  but  not  of 
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Figure  2  SChLAPI  expression  characterizes  aggressive  prostate  cancer,  (a)  Network  representation 
of  genes  positively  or  negatively  correlated  with  SChLAPI  in  localized  prostate  cancers  using 
Oncomine  concepts  analysis  and  visualized  with  the  Force-Directed  Layout  algorithm  in  the 
Cytoscape^o  tool.  The  top  panel  displays  genes  positively  correlated  with  SChLAPI.  The  bottom 
panel  displays  genes  negatively  correlated  with  SChLAPI.  Node  sizes  reflect  the  number  of  genes  jg  ^  o.4 

that  comprise  each  molecular  concept,  and  node  names  are  assigned  according  to  the  author  of  the  |  i 

primary  study^^e, 17,37-45  gs  detailed  in  Supplementary  Table  3b.  Nodes  are  colored  according  to  the  | 

concept  categories  indicated  in  the  figure  legend.  Edges  are  drawn  between  nodes  with  statistically  ^  0 

significant  enrichment  (P<  1  X  10“^;  odds  ratio  >  3.0),  and  darker  edge  shading  implies  higher  0  2  4  6  8  10  12 

odds  ratio,  (b)  Heatmap  representation  of  comparisons  between  coexpression  gene  signatures  and  Time  (years) 

molecular  concepts.  Comparisons  to  positively  (top)  and  negatively  (bottom)  correlated  gene  signatures  are 

shown  separately.  Comparisons  that  do  not  reach  statistical  significance  {q  >  0.01  or  odds  ratio  <  2)  are  shown  in  gray.  Associations  with  overexpression 
concepts  are  colored  red,  and  underexpression  concepts  are  colored  blue,  (c-e)  Kaplan-Meier  analyses  of  prostate  cancer  outcomes  in  the  Mayo  Clinic 
cohort.  SChLAPI  expression  was  measured  using  Affymetrix  exon  arrays,  and  subjects  were  stratified  according  to  their  SChLAPI  expression  level. 
Subject  outcomes  were  analyzed  for  biochemical  recurrence  (c),  clinical  progression  to  systemic  disease  (d)  and  prostate  cancer-specific  mortality  (e). 
The  shaded  regions  represent  95%  confidence  intervals.  P  values  for  Kaplan-Meier  curves  were  determined  using  a  log-rank  test. 


PCA3-  and  AMACi^- correlated  genes  with  high-grade  and  metastatic 
cancers  (Fig.  2b).  Kaplan-Meier  analysis  similarly  showed  significant 
associations  between  the  SChLAPI  signature  and  biochemical  recur¬ 
rence^^  and  overall  survivaF^  (Supplementary  Fig.  6a, b). 

To  directly  evaluate  the  relationship  between  SChLAPI  levels  and 
clinical  outcome,  we  next  used  SChLAPI  expression  to  stratify  235 
samples  from  individuals  with  localized  prostate  cancer  who  under¬ 
went  radical  prostatectomy  at  the  Mayo  Clinic^^  (Online  Methods 
and  Supplementary  Fig.  6c).  We  evaluated  samples  for  three  clinical 
endpoints:  biochemical  recurrence,  clinical  progression  to  systemic 
disease  and  prostate  cancer-specific  mortality  (Supplementary 
Table  4).  At  the  time  of  this  analysis,  subjects  had  a  median  follow-up 
time  of  8.1  years. 

SChLAPI  was  a  powerful  single-gene  predictor  of  aggressive  pros¬ 
tate  cancer  (Fig.  2c-e).  SChLAPI  expression  was  highly  significant 
when  distinguishing  disease  with  clinical  progression  and  prostate 
cancer-specific  mortality  (P  =  0.00005  and  0.002,  respectively; 
Fig.  2d,e).  For  the  biochemical  recurrence  endpoint,  high  SChLAPI 


expression  was  associated  with  a  shorter  median  time  to  progres¬ 
sion  (1.9  versus  5.5  years  for  individuals  with  high  and  low  expres¬ 
sion  of  SChLAPI  y  respectively;  Fig.  2c).  We  further  confirmed  this 
association  with  rapid  biochemical  recurrence  using  an  independent 
cohort  (Supplementary  Fig.  6d).  Multivariate  and  univariate  regres¬ 
sion  analyses  of  the  Mayo  Clinic  data  demonstrated  that  SChLAPI 
expression  is  an  independent  predictor  of  prostate  cancer  aggressive¬ 
ness,  with  highly  significant  hazard  ratios  for  predicting  biochemical 
recurrence,  clinical  progression  and  prostate  cancer-specific  mortality 
(hazard  ratios  of  3.045, 3.563  and  4.339,  respectively;  P  <  0.01),  which 
are  comparable  to  those  for  other  clinical  factors  such  as  advanced 
clinical  stage  and  Gleason  histopathological  score  (Supplementary 
Fig.  7  and  Supplementary  Note). 

To  explore  the  functional  role  of  SChLAPI  y  we  performed  small 
interfering  RNA  (siRNA)-mediated  knockdowns  to  compare  the 
impact  of  SChLAPI  depletion  to  that  of  EZH2y  which  is  essential 
for  cancer  cell  aggressiveness^^.  Notably,  knockdown  of  SChLAPI 
dramatically  impaired  cell  invasion  and  proliferation  in  vitro  to 
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Figure  3  SChLAPI  coordinates  cancer  cell  invasion  in  vitro  and  metastatic  seeding  in  vivo. 

(a)  siRNA  knockdown  of  SChLAPI  in  vitro  in  three  prostate  cell  lines  (LNCaP,  22Rvl,  Dul45) 
impairs  cellular  invasion  through  Matrigel  in  a  Boyden  chamber  assay.  EZH2  siRNA  serves  as  a 
positive  control  for  impaired  invasion.  Images  to  the  right  show  representative  Boyden  chamber 
membranes  stained  with  crystal  violet  following  invasion.  All  images  were  captured  at  the  same 
magnification,  as  indicated  by  the  scale  bars,  (b)  Overexpression  of  SChLAPI  in  RWPE  cells 
results  in  increased  cellular  invasion  through  Matrigel  in  Boyden  chamber  assays.  Images  to 
the  right  show  Boyden  chambers  as  in  a  in  indicated  RWPE  cell  lines;  scale  bars  represent 
equal  magnification  of  images,  (c)  Intracardiac  injection  of  severe  combined  immunodeficient 
(SCID)  mice  with  22Rvl  cells  with  stable  SChLAPI  knockdown.  Representative  luciferase 
bioluminescence  images  (BLI)  are  shown  for  mice  5  weeks  after  intracardiac  injection  with  22Rvl 
cells  expressing  non-targeting  control  shRNA  {n  =  9),  SChLAPI  shRNA  1  {n=  14)  or  SChLAPI 
shRNA  2  {n  =  14).  Mouse  IDs  are  given  above  each  image,  (d)  Relative  intensity  of  whole-mouse 
luciferase  signal  is  plotted  for  mice  with  intracardiac  injection  of  22Rvl  cells  expressing  control, 
non-targeting  shRNA  {n  =  9),  SChLAPI  shRNA  1  {n  =  14)  or  SChLAPI  shRNA  2  {n  =  14). 

(e)  Number  of  gross  metastatic  sites  observed  by  luciferase  signal  in  mice  injected  with  22Rvl 
cells  expressing  SChLAPI  shRNA  or  control,  non-targeting  shRNA.  Independent  foci  with 
luciferase  signal  were  averaged  for  mice  injected  with  22Rvl  cells  expressing  control  shRNA 
in  =  9),  SChLAPI  shRNA  1  {n  =  14)  or  SChLAPI  shRNA  2{n=  14).  (f)  Invasion  of  22Rvl  cells 
expressing  non-targeting  or  SChLAPI  shRNA  in  chick  CAM  assays.  22Rvl  cells  are  labeled  with 
GFP.  Images  are  counterstained  for  chicken  collagen  IV  to  visualize  vasculature  (REP)  and  for 
DAPI  to  visualize  nuclei.  Scale  bars  represent  arbitrary  units  indicating  equal  magnification  of 
images,  (g-i)  In  the  CAM  assay,  22Rvl  cells  expressing  SChLAPI  shRNA  demonstrate  decreased 
intravasation  (g),  metastatic  spread  to  the  liver  and  lungs  (h)  and  reduced  tumor  weight  (i)  relative 

to  22Rvl  cells  expressing  control,  non-targeting  shRNA.  (j)  Quantification  of  intravasation  of  RWPE  cells  expressing  LacZor  SChLAPI  in  the  CAM 
assay.  All  data  in  bar  plots  are  represented  as  mean  ±  s.e.m.  Statistical  significance  was  determined  by  two-tailed  Student’s  f  test:  *P<  0.05.  Box  plots 
in  e,i  display  box-and-whisker  plots  with  the  midpoint  line  indicating  the  median,  box  boundaries  showing  25th  and  75th  quartile  ranges  and  whiskers 
displaying  the  minimum  and  maximum  values. 


A 


an  extent  comparable  to  that  observed  with  knockdown  of  EZH2 
(Fig.  3a  and  Supplementary  Fig.  8a, b).  Overexpression  of  an  siRNA- 
resistant  SChLAPI  isoform  rescued  the  in  vitro  invasive  phenotype 
of  22Rvl  cells  treated  with  siRNA-2  (Supplementary  Fig.  8c, d). 
Overexpression  of  the  three  SChLAPI  isoforms  in  benign,  immor¬ 
talized  RWPE  prostate  cells  dramatically  increased  the  ability  of  these 
cells  to  invade  in  vitro  but  did  not  affect  cell  proliferation  (Fig.  3b  and 
Supplementary  Fig.  8e,f). 

To  test  SChLAPI  in  vivo,  we  performed  intracardiac  injection  of 
CB-17  SCID  mice  with  22Rvl  cells  stably  knocking  down  SChLAPI 
(Supplementary  Fig.  9a)  and  observed  that  SChLAPI  depletion 
impaired  metastatic  seeding  and  growth,  as  measured  by  luciferase 
signaling  at  both  proximal  (lungs)  and  distal  sites  (Fig.  3c,d).  Indeed, 
compared  to  mice  injected  with  22Rvl  cells  expressing  a  non-targeting 
control,  mice  injected  with  22Rvl  cells  stably  expressing  short 


hairpin  RNA  (shRNA)  against  SChLAPI  had  both  fewer  gross  metastatic 
sites  overall  as  well  as  smaller  metastatic  tumors  when  they  did  form 
(Fig.  3d,e).  Histopathological  analysis  of  the  metastatic  22Rvl  tumors, 
regardless  of  SChLAPI  knockdown,  showed  uniformly  high-grade  epi¬ 
thelial  cancer  (Supplementary  Fig.  9b).  Interestingly,  subcutaneous 
xenografts  with  stable  knockdown  of  SChLAPI  showed  slower  tumor 
progression;  however,  this  was  due  to  delayed  tumor  engraftment  rather 
than  to  decreased  tumor  growth  kinetics,  with  no  change  in  Ki67  stain¬ 
ing  observed  between  cells  expressing  SChLAPI  shRNA  and  control 
cells  expressing  non-targeting  shRNA  (Supplementary  Fig.  9c-i). 

Next,  using  the  chick  chorioallantoic  membrane  (CAM)  assay^^, 
we  found  that  22Rvl  cells  expressing  SChLAPI  shRNA  2,  which  have 
depleted  expression  of  both  isoforms  1  and  2,  had  greatly  reduced 
ability  to  invade,  intravasate  and  metastasize  to  distant  organs 
(Fig.  3f-h).  Additionally  cells  with  knockdown  of  SChLAPI  also 
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Figure  4  SChLAPI  antagonizes  SNF5  function  and  attenuates  SNF5  genome-wide  localization,  (a)  Fleatmap  results  for  SChLAPI  or  SMARCBl 
knockdown  in  LNCaP  and  22Rvl  cells.  The  numbers  above  the  heatmap  indicate  the  specific  siRNA  and  microarray  replicates,  (b)  RIP  of  SNF5  or 
androgen  receptor  (AR)  in  22Rvl  and  LNCaP  cells.  Inset,  protein  blots  showing  pulldown  efficiency.  Error  bars,  s.e.m.  (c)  RIP  analysis  of  SNF5  in 
RWPE  cells  overexpressing  LacZ,  SChLAPI  isoform  1  or  SChLAPI  isoform  2.  Inset,  protein  blots  showing  pulldown  efficiency.  Error  bars,  s.e.m. 

(d)  Pulldown  of  SChLAPI  RNA  using  chromatin  isolation  by  RNA  purification  (ChIRP)  recovers  SNF5  protein  in  RWPE  cells  expressing  SChLAPI 
isoform  1.  LacZ ar\6  TERC  serve  as  negative  and  positive  controls,  respectively,  (e)  Global  representation  of  SNF5  genomic  binding  over  a  4-kb  window 
centered  on  each  SNF5  ChIP-seq  peak  in  RWPE  cells  expressing  LacZ,  SChLAPI  isoform  1  or  SChLAPI  isoform  2.  (f)  Heatmap  of  SNF5  genomic 
binding  at  target  sites  in  RWPE  cells  expressing  LacZ,  SChLAPI  isoform  1  or  SChLAPI  isoform  2.  A  2-kb  interval  centered  on  the  called  SNF5  peak 
is  shown,  (g)  GSEA  results  showing  significant  enrichment  of  ChIP-seq  promoter  peaks  with  >2-fold  loss  of  SNF5  binding  for  underexpressed  genes  in 
RWPE  cells  expressing  SChLAPI.  NES,  normalized  enrichment  score;  FWER,  familywise  error  rate. 


resulted  in  decreased  tumor  growth  (Fig.  3i).  Notably,  RWPE  cells  with 
overexpression  of  SChLAPI  isoform  1  partially  supported  these  results, 
showing  a  markedly  increased  ability  to  intravasate  (Fig.  3j).  RWPE 
cells  overexpressing  SChLAPI  did  not  generate  distant  metastases  or 
cause  altered  tumor  growth  in  this  model  (data  not  shown).  Together, 
the  mouse  metastasis  and  CAM  data  strongly  implicate  SChLAPI  in 
tumor  invasion  and  metastasis  through  activity  in  cancer  cell  intrava- 
sation,  extravasation  and  subsequent  tumor  cell  seeding. 

To  elucidate  the  mechanisms  of  SChLAPI  function,  we  profiled 
22Rvl  and  LNCaP  cells  with  SChLAPI  knockdown,  identifying 
165  upregulated  and  264  downregulated  genes  (q  value  <  0.001) 
(Supplementary  Fig.  10a  and  Supplementary  Table  5a).  After  rank¬ 
ing  genes  according  to  differential  expressions^,  we  employed  Gene 
Set  Enrichment  Analysis  (GSEA)Si  to  search  for  enrichment  across 


the  Molecular  Signatures  Database  (MSigDB)^^.  Among  the  highest 
ranked  concepts,  we  noticed  genes  positively  or  negatively  correlated 
with  the  SWI/SNF  complex^^,  and  this  association  was  independ¬ 
ently  confirmed  using  gene  signatures  generated  from  our  RNA-seq 
data  (Supplementary  Fig.  lOb-e  and  Supplementary  Table  5b,c). 
Notably,  SC/zLAPi -regulated  genes  were  inversely  correlated  with 
these  data  sets,  suggesting  that  SChLAPI  and  the  SWI/SNF  complex 
function  in  opposing  manners. 

The  SWI/SNF  complex  regulates  gene  transcription  as  a  multipro¬ 
tein  system  that  physically  moves  nucleosomes  at  gene  promoters^"^. 
Loss  of  SWI/SNF  complex  functionality  promotes  cancer  progression, 
and  multiple  SWI/SNF  components  are  somatically  inactivated  in  can- 
cer24>25  SWI/SNF  complex  mutations  do  occur  in  prostate  cancer,  albeit 
not  commonly^,  and  downregulation  of  SWI/SNF  complex  members 


1396 


VOLUME  45  I  NUMBER  1 1  |  NOVEMBER  201  3  NATURE  GENETICS 


2013  Nature  America,  Inc.  All  rights  reserved. 


LETTERS 


@ 


characterizes  subsets  of  prostate  cancer^^*^^.  Thus,  antagonism 
of  SWI/SNF  complex  activity  by  SChLAPl  is  consistent  with  the  onco¬ 
genic  behavior  of  SChLAPl  and  the  tumor  suppressive  behavior  of 
the  SWI/SNF  complex. 

To  directly  test  whether  SChLAPl  antagonizes  SWI/SNF-mediated 
regulation,  we  performed  siRNA-mediated  knockdown  of  SMARCBl 
(which  encodes  the  SNF5  protein)  (Supplementary  Fig.  lOf),  an 
essential  subunit  that  facilitates  SWI/SNF  complex  binding  to  histone 
proteins^^’^^’^^,  and  confirmed  predicted  expression  changes  for  sev¬ 
eral  SChLAPl-  or  SNF5-regulated  genes  (Supplementary  Fig.  10g,h). 
A  comparison  of  genes  whose  expression  was  altered  by  knockdown  of 
SMARCBl  to  those  regulated  by  SC/zLAPi  demonstrated  an  antagonistic 
relationship  in  which  SChLAPl  knockdown  affected  the  same  genes 
as  SMARCBl  knockdown  but  with  opposing  directions  of  effect 
(Fig.  4a  and  Supplementary  Table  5d-h).  We  used  GSEA  to  quan¬ 
tify  and  verify  the  significance  of  these  findings  (false  discovery  rate 
(FDR)  <  0.05)  (Supplementary  Fig.  lOi-k).  Furthermore,  a  shared 
SMARCBl -SChLAPl  signature  of  coregulated  genes  was  highly 
enriched  for  prostate  cancer  clinical  signatures  for  disease  aggres¬ 
siveness  (Supplementary  Fig.  11  and  Supplementary  Table  5i). 

Mechanistically,  although  SChLAPl  and  SMARCBl  mRNA  lev¬ 
els  were  comparable  (Supplementary  Fig.  12a),  SChLAPl  knock¬ 
down  or  overexpression  did  not  alter  SNF5  protein  abundance 
(Supplementary  Fig.  12b),  suggesting  that  SChLAPl  regulates 
SWI/SNF  activity  post-translationally.  To  explore  this  possibility, 
we  performed  RNA  immunoprecipitation  assays  (RIPs)  for  SNF5. 
We  found  that  endogenous  SChLAPl  but  not  other  cytoplasmic 
or  nuclear  IncRNAs^’^^  robustly  coimmunoprecipitated  with  SNF5 
under  native  conditions  (Fig.  4b)  and  with  use  of  UV  cross-linking 
(Supplementary  Fig.  12c),  and  coimmunoprecipitation  was  also 
observed  with  a  second  antibody  to  SNF5  (Supplementary  Fig.  12d). 
In  contrast,  SChLAPl  did  not  coimmunoprecipitate  with  androgen 
receptor  (Fig.  4b).  Furthermore,  both  SChLAPl  isoform  1  and  isoform 
2  coimmunoprecipitated  with  SNF5  in  RWPE  overexpression  mod¬ 
els  (Fig.  4c  and  Supplementary  Fig.  12e).  SNRNP70  binding  to  U1 
RNA  was  used  as  a  technical  control  in  all  cell  lines  (Supplementary 
Fig.  12f,g).  Finally,  pulldown  of  SChLAPl  RNA  in  RWPE  cells  over¬ 
expressing  SChLAPl  isoform  1  robustly  recovered  SNF5  protein,  con¬ 
firming  this  interaction  (Fig.  4d  and  Supplementary  Fig.  12h). 

To  address  whether  SChLAPl  modulates  SWI/SNF  genomic 
binding,  we  performed  ChIP-seq  for  SNF5  in  RWPE  cells  express¬ 
ing  LacZ  or  SChLAPl  and  called  significantly  enriched  peaks 
with  respect  to  an  IgG  control  (Online  Methods  and  Supplementary 
Table  6a).  Protein  blot  validation  confirmed  SNF5  pulldown  by  ChIP 
(Supplementary  Fig.  13a).  After  aggregating  called  peaks  from 
all  samples,  we  found  6,235  genome-wide  binding  sites  for  SNF5 
(FDR  <  0.05;  Supplementary  Table  6b),  which  were  highly  enriched 
for  sites  near  gene  promoters  (Supplementary  Fig.  13b),  supporting 
results  from  previous  studies  of  SWI/SNF  binding^^“^k 

A  comparison  of  SNF5  binding  across  these  6,235  genomic  sites 
demonstrated  a  dramatic  decrease  in  SNF5  genomic  binding  as  a  result 
of  SChLAPl  over  expression  (Fig.  4e,f  and  Supplementary  Fig.  13c).  Of 
the  1,299  SNF5  peaks  occurring  within  1  kb  of  a  gene  TSS,  390  showed 
relative  SNF5  binding  that  was  decreased  by  >2-fold  with  SChLAPl 
overexpression  (Supplementary  Fig.  13d  and  Supplementary 
Table  6c).  To  verify  these  findings  independently,  we  performed 
Chip  for  SNF5  in  22Rvl  cells  expressing  shRNA  to  SChLAPl,  with 
the  hypothesis  that  knockdown  of  SChLAPl  should  increase  SNF5 
genomic  binding  compared  to  controls.  We  found  that  9  of  12  target 
genes  showed  a  substantial  increase  in  SNF5  binding  with  knockdown 
of  SChLAPl  (Supplementary  Fig.  14a),  confirming  our  predictions. 


Finally,  we  used  expression  profiling  of  RWPE  cells  expressing  LacZ 
or  SChLAPl  to  characterize  the  relationship  between  SNF5  binding 
and  SC/zLAPi -mediated  changes  in  gene  expression.  After  identify¬ 
ing  a  gene  signature  with  highly  significant  changes  in  expression 
(Supplementary  Table  6d),  we  intersected  this  signature  with  the 
ChIP-seq  data.  We  observed  that  a  substantial  subset  of  genes  with 
>2-fold  relative  decrease  in  SNF5  genomic  binding  were  dysregu- 
lated  when  SChLAPl  was  overexpressed  (Supplementary  Fig.  14b). 
Decreased  SNF5  binding  was  primarily  associated  with  the  down- 
regulation  of  target  gene  expression  (Supplementary  Table  6e), 
although  the  SWI/SNF  complex  is  known  to  regulate  expres¬ 
sion  in  either  direction^^’^^.  Integrative  GSEA  of  the  microarray 
and  SNF5  ChIP-seq  data  demonstrated  significant  enrichment 
for  genes  that  were  repressed  when  SChLAPl  was  overexpressed 
{q  value  =  0.003;  Fig.  4g).  Overall,  these  data  argue  that  SChLAPl 
overexpression  antagonizes  SWI/SNF  complex  function  by  attenuat¬ 
ing  the  genomic  binding  of  this  complex,  thereby  impairing  its  ability 
to  properly  regulate  gene  expression. 

Here  we  have  discovered  SChLAPl,  a  highly  prognostic  IncRNA 
that  is  abundantly  expressed  in  -25%  of  prostate  cancers  and  that 
aids  in  the  discrimination  of  aggressive  tumors  from  indolent  forms 
of  the  disease.  Mechanistically,  we  find  that  SChLAPl  coordinates 
cancer  cell  invasion  in  vitro  and  metastatic  spread  in  vivo.  Moreover, 
we  characterize  an  antagonistic  SC/zLAPi -SWI/SNF  axis  in  which 
SChLAPl  impairs  SNF5-mediated  regulation  of  gene  expression  and 
genomic  binding  (Supplementary  Fig.  14c).  Thus,  whereas  other 
IncRNAs  such  as  HOTAIR  and  HOTTIP  are  known  to  assist  epige¬ 
netic  complexes  such  as  PRC2  and  MEL  by  facilitating  their  genomic 
binding  and  enhancing  their  functions^’^’^^,  SChLAPl  is  the  first 
IncRNA,  to  our  knowledge,  that  impairs  a  major  epigenetic  complex 
with  well-documented  tumor  suppressor  function^^"^^’^^"^^.  Our 
discovery  of  SChLAPl  has  broad  implications  for  cancer  biology  and 
provides  supporting  evidence  for  the  role  of  IncRNAs  in  the  progres¬ 
sion  of  aggressive  cancers. 

URLs.  Stellaris  probe  designer,  http://www.singlemoleculefish.com; 
HT-Seq,  http://www-huber.embl.de/users/anders/HTSeq/;  Bio  Venn, 
http://www.cmbi.ru.nl/ cdd/biovenn/ ;  Galaxy,  http:/ / usegalaxy.org/. 

METHODS 

Methods  and  any  associated  references  are  available  in  the  online 
version  of  the  paper. 

Accession  codes.  Sequences  for  SChLAPl  isoforms  1-7  have  been 
deposited  in  GenBank  under  accessions  JXl  17418,  JXl  17419, 
JXl  17420,  JXl  17421,  JXl  17422,  JXl  17423  and  JXl  17424.  Microarray 
data  have  been  deposited  in  the  Gene  Expression  Omnibus  (GEO) 
under  accession  GSE40386. 

Note:  Any  Supplementary  Information  and  Source  Datafiles  are  available  in  the 
online  version  of  the  paper. 
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ONLINE  METHODS 

Cell  lines.  All  cell  lines  were  obtained  from  the  American  Type  Culture 
Collection.  Cell  lines  were  maintained  using  standard  media  and  conditions. 
Specifically,  VCaP  and  Dul45  cells  were  maintained  in  DMEM  (Invitrogen) 
supplemented  with  10%  FBS  and  1%  penicillin-streptomycin.  LNCaP  and 
22Rvl  cells  were  maintained  in  RPMI  1640  (Invitrogen)  supplemented  with 
10%  FBS  and  1%  penicillin-streptomycin.  RWPE  cells  were  maintained  in 
KSF  medium  (Invitrogen)  supplemented  with  10  ng/ml  epidermal  growth 
factor  (EGF;  Sigma)  and  bovine  pituitary  extract  (BPE)  and  with  1%  penicillin- 
streptomycin.  All  cell  lines  were  grown  at  37  °C  in  a  5%  CO2  cell  culture  incu¬ 
bator.  All  cell  lines  were  genotyped  for  identity  at  the  University  of  Michigan 
Sequencing  Core  and  were  tested  routinely  for  Mycoplasma  contamination. 

Cell  lines  expressing  SChLAPl  or  control  constructs  were  generated  by 
cloning  SChLAPl  or  control  sequence  into  the  pFentib  vector  (Invitrogen), 
using  pcr8  non-directional  Gateway  cloning  (Invitrogen)  as  an  initial  clon¬ 
ing  vector,  and  shuttling  to  pFentib  using  FR  clonase  II  (Invitrogen)  accord¬ 
ing  to  the  manufacturer’s  instructions.  Stably  transfected  RWPE  and  22Rvl 
cells  were  selected  with  blasticidin  (Invitrogen)  for  I  week.  For  FNCaP  and 
22RvI  cells  with  stable  knockdown  of  SChLAPl,  cells  were  transfected  with 
lentiviral  constructs  encoding  SChLAPl  shRNA  or  with  non-targeting  shRNA 
lentiviral  constructs  for  48  h.  GFP-positive  cells  were  selected  with  1  |lg/ml 
puromycin  for  72  h.  All  lentiviruses  were  generated  by  the  University  of 
Michigan  Vector  Core. 

Tissue  samples.  Prostate  tissues  were  obtained  from  the  radical  prostatec¬ 
tomy  series  and  Rapid  Autopsy  Program  at  the  University  of  Michigan  tissue 
core^^.  These  programs  are  part  of  the  University  of  Michigan  Prostate  Cancer 
Specialized  Program  of  Research  Excellence  (SPORE).  All  tissue  samples  were 
collected  with  informed  consent  under  an  institutional  review  board  (IRB)- 
approved  protocol  at  the  University  of  Michigan  (SPORE  in  Prostate  Cancer 
(Tissue/Serum/Urine)  Bank  Institutional  Review  Board  1994-0481). 

RNA  isolation  and  cDNA  synthesis.  Total  RNA  was  isolated  using  TRIzol 
(Invitrogen)  and  an  RNeasy  kit  (Qiagen)  with  DNase  I  digestion  according 
to  the  manufacturers’  instructions.  RNA  integrity  was  verified  on  an  Agilent 
Bioanalyzer  2100  (Agilent  Technologies).  cDNA  was  synthesized  from  total 
RNA  using  Superscript  III  (Invitrogen)  and  random  primers  (Invitrogen). 

Quantitative  RT-PCR.  Quantitative  RT-PCR  was  performed  using  Power 
SYBR  Green  MasterMix  (Applied  Biosystems)  on  an  Applied  Biosystems 
7900HT  Real-Time  PCR  System.  All  oligonucleotide  primers  were  obtained 
from  Integrated  DNA  Technologies  (IDT),  and  primer  sequences  are  listed  in 
Supplementary  Table  7a.  The  housekeeping  genes  GAPDH,  HMBS  and  ACTB 
were  used  as  loading  controls.  Fold  changes  were  calculated  relative  to  house¬ 
keeping  genes  and  were  normalized  to  the  median  value  in  benign  samples. 

RT-PCR.  RT-PCR  was  performed  for  primer  pairs  using  Platinum  Taq  High- 
Fidelity  polymerase  (Invitrogen).  PCR  products  were  resolved  on  a  1.0% 
agarose  gel.  PCR  products  were  then  either  sequenced  directly  (if  only  a 
single  product  was  observed)  or  appropriate  gel  products  were  extracted 
using  a  Gel  Extraction  kit  (Qiagen)  and  cloned  into  pcr4-TOPO  vector 
(Invitrogen).  PCR  products  were  bidirectionally  sequenced  at  the  University 
of  Michigan  Sequencing  Core  using  either  gene-specific  primers  or  M13 
forward  and  reverse  primers  for  cloned  PCR  products.  All  oligonucle¬ 
otide  primers  were  obtained  from  IDT,  and  primer  sequences  are  listed  in 
Supplementary  Table  7a. 

RACE.  5'  and  3'  RACE  were  performed  using  the  GeneRacer  RLM-RACE 
kit  (Invitrogen)  according  to  the  manufacturer’s  instructions.  RACE  PCR 
products  were  obtained  using  Platinum  Taq  High-Fidelity  polymerase, 
the  supplied  GeneRacer  primers  and  the  appropriate  gene-specific  primers 
indicated  in  Supplementary  Table  7a.  RACE  PCR  products  were  separated 
on  a  1.5%  agarose  gel.  Gel  products  were  extracted  with  a  Gel  Extraction 
kit,  cloned  into  pcr4-TOPO  vectors  and  sequenced  bidirectionally  using  M13 
forward  and  reverse  primers  at  the  University  of  Michigan  Sequencing  Core. 
At  least  three  colonies  were  sequenced  for  every  RACE  PCR  product  that 
was  gel  purified. 


siRNA-mediated  knockdown.  Cells  were  plated  in  100-mm  plates  at  a  desired 
concentration  and  transfected  with  20  pM  experimental  siRNA  oligonucleo¬ 
tides  or  non-targeting  controls  twice  at  8  h  and  24  h  after  plating.  Knockdown 
was  performed  with  Oligofectamine  in  OptiMEM  medium.  Knockdown 
efficiency  was  determined  by  qPCR.  siRNA  sequences  (in  sense  orientation) 
for  knockdown  experiments  are  listed  in  Supplementary  Table  7b.  At  72  h 
after  transfection,  cells  were  trypsinized,  counted  with  a  Coulter  counter  and 
diluted  to  1  million  cells/ml. 

Overexpression.  Full-length  SChLAPl  transcript  was  amplified  from  LNCaP 
cells  and  cloned  into  the  pLentib  vector  along  with  LacZ  control  sequence. 
Insert  sequences  were  confirmed  by  Sanger  sequencing  at  the  University  of 
Michigan  Sequencing  Core.  Lentiviruses  were  generated  at  the  University  of 
Michigan  Vector  Core.  The  benign  immortalized  prostate  cell  line  RWPE  was 
infected  with  lentiviruses  expressing  SChLAPl  or  LacZ,  and  stable  pools  and 
clones  were  generated  by  selection  with  blasticidin.  Similarly,  the  immortalized 
cancer  cell  line  22Rvl  was  infected  with  lentiviruses  expressing  SChLAPl  or 
LacZ,  and  stable  pools  were  generated  by  selection  with  blasticidin. 

Cell  proliferation  assays.  At  72  h  after  transfection  with  siRNA,  cells  were 
trypsinized,  counted  with  a  Coulter  counter  and  diluted  to  1  million  cells/ml. 
For  proliferation  assays,  10,000  cells  were  plated  in  each  well  of  a  24-well  plate 
and  grown  in  regular  growth  medium.  At  48  h  and  96  h  after  plating,  cells  were 
collected  by  trypsinizing  and  counted  using  a  Coulter  counter.  All  assays  were 
performed  in  quadruplicate. 

Basement  membrane  matrix  invasion  assays.  For  invasion  assays,  cells  were 
treated  with  the  indicated  siRNAs,  and,  at  72  h  after  transfection,  cells  were 
trypsinized,  counted  with  a  Coulter  counter  and  diluted  to  1  million  cells/ml. 
Cells  were  seeded  onto  basement  membrane  matrix  (EC  matrix,  Chemicon) 
present  in  the  insert  of  a  24-well  culture  plate.  FBS  was  added  to  the  lower 
chamber  as  a  chemoattractant.  After  48  h,  the  non-invading  cells  and  EC 
matrix  were  gently  removed  with  a  cotton  swab.  Invasive  cells  located  on  the 
lower  side  of  the  chamber  were  stained  with  crystal  violet,  air  dried  and  pho¬ 
tographed.  For  colorimetric  assays,  inserts  were  treated  with  150  pi  of  10% 
acetic  acid,  and  absorbance  was  measured  at  560  nm  using  a  spectrophotom¬ 
eter  (GE  Healthcare). 

sbRNA-mediated  knockdown.  The  prostate  cancer  cell  lines  LNCaP  and 
22Rvl  were  seeded  at  50-60%  confluency  and  were  allowed  to  attach  over¬ 
night.  Cells  were  transfected  with  lentiviral  constructs  expressing  SChLAPl 
or  non-targeting  shRNA  as  described  previously  for  48  h.  GFP-positive  cells 
were  selected  with  1  pg/ml  puromycin  for  72  h.  At  48  h  after  the  start  of  selec¬ 
tion,  cells  were  collected  for  protein  and  RNA  using  RIPA  buffer  or  TRIzol, 
respectively.  RNA  was  processed  as  described  above. 

Gene  expression  profiling.  Expression  profiling  was  performed  using  the 
Agilent  Whole  Human  Genome  Oligo  Microarray  according  to  previously 
published  protocols^^.  All  samples  were  run  in  technical  triplicates,  compar¬ 
ing  knockdown  samples  treated  with  SChLAPl  siRNA  to  samples  treated  with 
non-targeting  control  siRNA.  Expression  data  were  analyzed  using  the  SAM 
method  as  described  previously^®. 

Mouse  intracardiac  and  subcutaneous  in  vivo  models.  All  experimental 
procedures  were  approved  by  the  University  of  Michigan  Committee  for  the 
Use  and  Care  of  Animals  (UCUCA). 

For  the  intracardiac  injection  model,  5x10^  cells  from  1  of  3  experimental 
cell  lines  (22Rvl-shSC/zLAPi-l  or  22Rvl-shSC/zLAPi-2  (two  cell  lines  express¬ 
ing  SChLAPl  shRNA)  or  22Rvl-shNT  (expressing  control  vector),  all  with 
luciferase  constructs  incorporated)  were  introduced  into  CB-17  SCID  mice 
at  6  weeks  of  age.  Female  mice  were  used  to  minimize  endogenous  androgen 
production  that  might  stimulate  xenografted  prostate  cells.  We  used  15  mice 
per  cell  line  to  ensure  adequate  statistical  power  to  distinguish  phenotypes 
between  groups.  Mice  used  in  these  studies  were  randomized  by  double-blind 
injection  of  cell  line  samples  into  mice  and  were  monitored  for  tumor  growth 
by  researchers  blinded  to  the  study  design.  Beginning  1  week  after  injection, 
bioluminescent  imaging  of  mice  was  performed  weekly  using  a  CCD  IVIS 
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system  with  a  50-mm  lens  (Xenogen),  and  the  results  were  analyzed  using 
Livinglmage  software  (Xenogen).  When  the  a  mouse  reached  the  determined 
end  point,  defined  as  whole-body  region  of  interest  (ROI)  of  1  x  10^®  photons, 
or  became  fatally  ill,  it  was  euthanized,  and  the  lung  and  liver  were  resected. 
Half  of  the  resected  specimen  was  placed  in  an  immunohistochemistry  cas¬ 
sette,  incubated  in  10%  buffered  formalin  phosphate  (Fisher  Scientific)  for 
24  h  and  transferred  to  70%  ethanol  until  further  analysis.  The  other  half 
of  each  specimen  was  snap  frozen  in  liquid  nitrogen  and  stored  at  -80  °C. 
A  specimen  was  disregarded  if  the  tumor  was  localized  only  in  the  heart. 
After  accounting  for  these  considerations,  there  were  9  mice  analyzed  for 
22Rvl-shNT  cells  and  14  mice  each  analyzed  for  22Rvl-shSC/zLAPi-l  and 
22Rvl-shSC/zLAPi-2  cells. 

For  the  subcutaneous  injection  model,  1  x  10^  cells  from  1  of  the  3  previ¬ 
ously  described  experimental  cell  lines  were  introduced  into  mice  (CB-17 
SCID),  aged  5-7  weeks,  with  a  Matrigel  scaffold  (BD  Matrigel  Matrix,  BD 
Biosciences)  in  the  posterior  dorsal  flank  region  {n=  10  per  cell  line).  Tumors 
were  measured  weekly  using  a  digital  caliper,  and  the  end  point  was  defined  by 
tumor  volume  of  1,000  mm^.  When  a  mouse  reached  the  end  point  or  became 
fatally  ill,  it  was  euthanized,  and  the  primary  tumor  was  resected.  The  resected 
specimen  was  divided  in  half:  one  half  was  placed  in  10%  buffer  formalin, 
and  the  other  half  was  snap  frozen.  For  histological  analyses,  formalin -fixed, 
paraffin-embedded  mouse  livers  and  lungs  were  sectioned  on  a  microtome 
into  5-jim  sections  on  glass  slides.  Slides  were  stained  with  hematoxylin 
and  eosin  using  standard  methods  and  were  analyzed  by  a  board-certified 
pathologist  (L.RK.). 

Immunoblot  analysis.  Cells  were  lysed  in  RIPA  lysis  buffer  (Sigma)  sup¬ 
plemented  with  HALT  protease  inhibitor  (Fisher).  Protein  blotting  analy¬ 
sis  was  performed  with  standard  protocols  using  polyvinylidene  difluoride 
(PVDF)  membrane  (GE  Healthcare),  and  signals  were  visualized  with 
an  enhanced  chemiluminescence  system  as  described  by  the  manufacturer 
(GE  Healthcare). 

Protein  lysates  were  boiled  in  sample  buffer,  and  10  jig  of  protein  was 
loaded  onto  an  SDS-PAGE  gel  and  run  for  separation  of  proteins.  Proteins 
were  transferred  onto  PVDE  membrane  and  blocked  for  90  min  in  block¬ 
ing  buffer  (5%  milk  in  a  solution  of  0.1%  Tween-20  in  Tris-buffered  saline 
(TBS-T)).  Membranes  were  incubated  overnight  at  4  °C  with  primary  anti¬ 
body.  After  three  washes  with  TBS-T  and  one  wash  with  TBS,  the  blot  was 
incubated  with  HRP-conjugated  secondary  antibody,  and  signal  was  visualized 
with  an  enhanced  chemiluminescence  system  as  described  by  the  manufac¬ 
turer.  Primary  antibodies  used  included  antibody  to  SNP5  (1:1,000  dilution; 
Millipore,  ABD22,  rabbit),  SNP5  (1:1,000  dilution;  Abeam,  ab58209,  mouse), 
p-actin  (1:5,000  dilution;  Sigma,  A5316,  mouse)  and  androgen  receptor 
(1:1,000  dilution;  Millipore,  06-680,  rabbit). 

RIP  assays.  RIP  assays  were  performed  using  a  Millipore  EZ-Magna  RIP 
RNA-Binding  Protein  Immunoprecipitation  kit  (Millipore,  17-701)  accord¬ 
ing  to  the  manufacturer’s  instructions.  RIP  PCR  was  performed  as  qPCR,  as 
described  above,  using  total  RNA  as  input  controls.  We  used  1/150  volume  of 
the  RIP  RNA  product  per  PCR  reaction.  Antibodies  used  for  RIP  included  rab¬ 
bit  polyclonal  IgG  (Millipore,  PP64)  and  antibodies  to  SNRNP70  (Millipore, 
CS203216),  SNP5  (Millipore,  ABD22,  rabbit),  SNP5  (Abeam,  ab58209, 
mouse)  and  androgen  receptor  (Millipore,  06-680,  rabbit),  and  5-7  jig 
of  antibody  was  used  per  RIP  reaction.  All  RIP  assays  were  performed  in  bio¬ 
logical  duplicate.  Eor  UV-crosslinked  RIP  experiments,  cells  were  subjected  to 
400  J  of  254  nM  UV  light  twice  and  were  then  collected  for  RIP  experiments 
as  described  above. 

Chip  assays.  ChIP  assays  were  performed  as  described  previously^  using 
antibody  for  SNP5  (Millipore,  ABD22,  rabbit)  and  rabbit  IgG  (Millipore, 
PP64B).  Briefly,  approximately  1  million  cells  were  cross-linked  per  antibody 
for  10-15  min  with  1%  formaldehyde,  and  crosslinking  was  inactivated  by 
incubation  with  0.125  M  glycine  for  5  min  at  room  temperature.  Cells  were 
rinsed  with  cold  PBS  three  times,  and  cell  pellets  were  resuspended  in  lysis 
buffer  supplemented  with  protease  inhibitors.  Chromatin  was  sonicated  to  an 
average  length  of  500  bp  and  centrifuged  to  remove  debris,  and  supernatants 
containing  chromatin  fragments  were  incubated  with  protein  A  or  protein  G 


beads  to  reduce  non-specific  binding.  Beads  were  then  removed,  and  super¬ 
natants  were  incubated  with  6  jig  of  antibody  overnight  at  4  °C.  Eresh  beads 
were  added  and  incubated  with  protein -chromatin -antibody  complexes  for 
2  h  at  4  °C,  washed  twice  with  lx  dialysis  buffer  and  four  times  with  IP  wash 
buffer,  and  eluted  in  150  |il  of  IP  elution  buffer  One-tenth  of  the  ChIP  reac¬ 
tion  was  taken  for  protein  evaluation  for  validation  of  pulldown.  Cross-linking 
was  reversed  by  incubating  eluted  products  with  0.3  M  NaCl  at  65  °C  over¬ 
night.  Chip  products  were  cleaned  with  the  USB  PrepEase  kit.  ChIP  experi¬ 
ments  were  validated  for  specificity  of  the  antibody  by  protein  blotting. 

ChIP-seq  experiments.  Paired-end  ChIP-seq  libraries  were  generated 
following  the  Illumina  ChIP-seq  protocol  with  minor  modifications. 
DNA  isolated  by  ChIP  assay  was  subjected  to  end  repair  and  A  tailing  before 
ligation  with  Illumina  adaptors.  Samples  were  purified  using  AMPure 
beads  (Beckman  Coulter)  and  PCR  enriched  with  a  combination  of  spe¬ 
cific  index  primers  and  PE2.0  primer  under  the  following  conditions:  98  °C 
(30  s),  65  °C  (30  s)  and  72  °C  (40  s,  with  the  addition  of  4  s  per  cycle).  After 
14  cycles  of  amplification,  a  final  extension  at  72  °C  for  5  min  was  carried  out. 
Barcoded  libraries  were  size  selected  using  3%  NuSieve  Agarose  gels  (Lonza) 
and  subjected  to  an  additional  PCR  enrichment  step.  Libraries  were  ana¬ 
lyzed  and  quantified  using  a  Bioanalyzer  instrument  (Agilent  Technologies) 
before  they  were  subjected  to  paired-end  sequencing  using  the  Illumina 
HiSeq  platform. 

CAM  assays.  CAM  assays  were  performed  as  previously  described^ Briefly, 
fertilized  chicken  eggs  were  incubated  in  a  rotary  humidified  incubator  at 
38  °C  for  10  d.  CAM  was  released  by  applying  a  mild  amount  of  pressure  to 
the  hole  over  the  air  sac  and  cutting  a  1-cm^  window  encompassing  a  second 
hole  near  the  allantoic  vein.  Approximately  2  million  cells  in  50  |ll  of  medium 
were  implanted  in  each  egg,  windows  were  sealed,  and  eggs  were  returned  to 
a  stationary  incubator. 

Eor  local  invasion  and  intravasation  experiments,  the  upper  and  lower 
CAMS  were  isolated  after  72  h.  Upper  CAMs  were  processed  and  stained  for 
chicken  collagen  IV  (immunofluorescence)  or  human  cytokeratin  (immuno¬ 
histochemistry)  as  previously  described^^. 

Eor  metastasis  assays,  embryonic  livers  were  isolated  on  day  18  of  embry¬ 
onic  growth  and  analyzed  for  the  presence  of  tumor  cells  by  quantitative 
human  Alu-specific  PCR.  Genomic  DNA  isolates  from  lower  CAMs  and 
livers  were  prepared  using  the  Puregene  DNA  purification  system  (Qiagen), 
and  quantification  with  human  Alu-specific  PCR  was  performed  as  described^^. 
Eluorogenic  TaqMan  qPCR  probes  were  generated  as  described  above  and  used 
to  determine  DNA  copy  number. 

Eor  xenograft  growth  assays  with  RWPE  cells,  embryos  were  sacrificed  on 
day  18,  and  extraembryonic  xenografts  were  excised  and  weighed. 

In  situ  hybridization.  In  situ  hybridization  assays  were  performed  as  a  com¬ 
mercial  service  from  Advanced  Cell  Diagnostics,  Inc.  Briefly,  cells  in  the  clini¬ 
cal  specimens  were  fixed  and  permeablized  using  xylene,  ethanol  and  protease 
to  allow  for  probe  access.  Slides  were  boiled  in  pretreatment  buffer  for  15  min 
and  rinsed  in  water.  Next,  two  independent  target  probes  were  hybridized  to 
SChLAPl  RNA  at  40  °C  for  2  h,  with  this  pair  of  probes  creating  a  binding  site 
for  a  preamplifier.  After  this  incubation,  the  preamplifier  was  hybridized  to  the 
target  probes  at  30  °C  and  amplified  with  six  cycles  of  hybridization  followed 
by  two  washes.  Cells  were  counterstained  to  visualize  signal.  Einally,  slides 
were  stained  with  hematoxylin  and  eosin,  dehydrated  with  100%  ethanol  and 
xylene  and  mounted  in  a  xylene -based  mounting  medium. 

In  vitro  translation.  Eull-length  SChLAPl,  PCAT-1  or  GUS  positive  con¬ 
trol  sequences  were  cloned  into  the  PCR2.1  entry  vector  (Invitrogen).  Insert 
sequences  were  confirmed  by  Sanger  sequencing  at  the  University  of  Michigan 
Sequencing  Core.  In  vitro  translation  assays  were  performed  with  the  TnT 
Quick  Coupled  Transcription/Translation  System  (Promega)  with  1  mM 
methionine  and  Transcend  Biotin-Lysyl-tRNA  (Promega)  according  to  the 
manufacturer’s  instructions. 

ChIRP  assays.  ChIRP  assays  were  performed  as  previously  described^^. 
Briefly,  antisense  DNA  probes  targeting  the  full-length  SChLAPl  sequence 
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were  designed  using  the  online  designer  at  Stellaris  (see  URLs).  Fifteen  probes 
spanning  the  entire  transcript  and  unique  to  the  SChLAPl  sequence  were 
chosen.  Additionally,  ten  probes  were  designed  against  TERC  RNA  as  a  posi¬ 
tive  control,  and  24  probes  were  designed  against  LacZ  RNA  as  a  negative 
control.  All  probes  were  synthesized  with  3'  biotinylation  (IDT).  Sequences 
of  all  probes  are  listed  in  Supplementary  Table  8.  RWPE  cells  overexpressing 
SChLAPl  isoform  1  were  grown  to  80%  confluency  in  100-mm  cell  culture 
dishes.  Two  dishes  were  used  for  each  probe  set.  Before  being  collected,  cells 
were  rinsed  with  lx  PBS  and  cross-linked  with  1%  glutaraldehyde  (Sigma) 
for  10  min  at  room  temperature.  Cross-linking  was  quenched  by  incuba¬ 
tion  with  0.125  M  glycine  for  5  min  at  room  temperature.  Cells  were  rinsed 
twice  with  lx  PBS,  collected  and  pelleted  at  1,500^  for  5  min.  Nuclei  were 
isolated  using  the  Pierce  NE-PER  Nuclear  Protein  Extraction  kit.  Nuclear 
pellets  were  resuspended  in  100  mg/ml  cell  lysis  buffer  (50  mM  Tris,  pH  7.0, 
10  mM  EDTA,  1%  SDS,  and,  added  before  use,  1  mM  dithiothreitol  (DTT), 
phenylmethylsulphonyl  fluoride  (PMSE),  protease  inhibitor  and  Superase-In 
(Invitrogen)).  Lysates  were  placed  on  ice  for  10  min  and  sonicated  using  a 
Bioruptor  (Diagenode)  at  the  highest  setting  with  30-s  on  and  45-s  off  cycles 
until  lysates  were  completely  solubilized.  Cell  lysates  were  diluted  in  twice 
the  volume  of  hybridization  buffer  (500  mM  NaCl,  1%  SDS,  100  mM  Tris, 
pH  7.0,  10  mM  EDTA,  15%  formamide,  and,  added  before  use,  DTT,  PMSE, 
protease  inhibitor  and  Superase-In),  and  100  nM  probes  were  added  to  the 
diluted  lysates.  Hybridization  was  carried  out  by  end-over-end  rotation  at 
37  °C  for  4  h.  Magnetic  streptavidin  Cl  beads  were  prepared  by  washing  three 
times  in  cell  lysis  buffer  and  were  then  added  to  each  hybridization  reaction 
at  a  concentration  of  100  pi  per  100  pmol  of  probe.  Reactions  were  incubated 
at  37  °C  for  30  min  with  end-over-end  rotation.  Bead-probe-RNA  complexes 
were  captured  with  magnetic  racks  (Millipore)  and  washed  five  times  with 
1  ml  wash  buffer  (2x  SSC,  0.5%  SDS,  fresh  PMSE  added).  After  the  final  wash, 
20%  of  the  sample  was  used  for  RNA  isolation,  and  80%  of  the  sample  was  used 
for  protein  isolation.  Eor  RNA  elution,  beads  were  resuspended  in  200  pi  of 
RNA  proteinase  K  buffer  (100  mM  NaCl,  10  mM  Tris,  pH  7.0,  1  mM  EDTA, 
0.5%  SDS)  and  1  mg/ml  proteinase  K  (Ambion).  Samples  were  incubated  at 
50  °C  for  45  min  and  then  boiled  for  10  min.  RNA  was  isolated  using  500  pi 
of  TRIzol  reagent  and  the  miRNeasy  kit  (Qiagen)  with  on-column  DNase 
digestion  (Qiagen).  RNA  was  eluted  with  10  pi  of  water  and  then  analyzed 
by  quantitative  RT-PCR  for  the  detection  of  enriched  transcripts.  Eor  protein 
elution,  beads  were  resuspended  in  three  times  the  original  volume  of  DNase 
buffer  (100  mM  NaCl,  0.1%  NP-40),  and  protein  was  eluted  with  a  cocktail 
of  100  pg/ml  RNase  A  (Sigma- Aldrich),  0.1  U/ml  RNase  H  (Epicenter)  and 
100  U/ml  DNase  I  (Invitrogen)  at  37  °C  for  30  min.  Eluted  protein  samples 
were  supplemented  with  NuPAGE  LDS  Sample  Buffer  (Novex)  and  NuPAGE 
Sample  Reducing  Agent  (Novex)  to  a  final  concentration  of  lx  each  and  then 
boiled  for  10  min  before  SDS-PAGE  protein  blot  analysis  using  an  antibody 
to  SNE5  (Millipore). 

RNA-seq  library  preparation.  Total  RNA  was  extracted  from  healthy  and 
cancer  cell  lines  and  subject  tissues,  and  RNA  quality  was  assessed  via  Agilent 
Bioanalyzer.  Transcriptome  libraries  from  the  mRNA  fractions  were  generated 
following  the  RNA-seq  protocol  (Illumina).  Each  sample  was  sequenced  in  a 
single  lane  with  the  Illumina  Genome  Analyzer  II  (with  a  40-  to  80-nt  read 
length)  or  with  the  Illumina  HiSeq  2000  (with  a  100-nt  read  length)  according  to 
published  protocols^  Eor  strand-specific  library  construction,  we  employed 
the  dUTP  method  of  second-strand  marking  as  described  previously^®. 

Statistical  analyses  for  experimental  studies.  All  data  are  presented  as  means 
±  s.e.m.  All  experimental  assays  were  performed  in  duplicate  or  triplicate. 
Statistical  analyses  shown  in  figures  represent  Eisher’s  exact  tests  or  two-tailed 
t  tests,  as  indicated.  Eor  details  regarding  the  statistical  methods  employed 
during  microarray,  RNA-seq  and  ChIP-seq  data  analysis,  see  below. 

Nomination  of  SChLAPl  as  an  outlier  using  RNA-seq  data.  We  nomi¬ 
nated  SChLAPl  as  a  prostate  cancer  outlier  as  described^  k  Briefly,  a  modi¬ 
fied  COPA  analysis  was  performed  on  the  81  tissue  samples  in  the  cohort. 
Reads  per  kilobase  per  million  mapped  reads  (RPKM)  expression  values  were 
used  and  shifted  by  1.0  to  avoid  division  by  zero.  COPA  analysis  included  the 


following  steps:  (i)  gene  expression  values  were  median  centered,  using  the 
median  expression  value  for  the  gene  across  all  samples  in  the  cohort,  which 
sets  the  gene’s  median  to  zero;  (ii)  the  median  absolute  deviation  (MAD)  was 
calculated  for  each  gene,  and  each  gene  expression  value  was  then  scaled  by  its 
MAD;  (iii)  the  80th,  85th,  90th  and  98th  percentiles  of  the  transformed  expres¬ 
sion  values  were  calculated  for  each  gene,  the  average  of  those  four  values  was 
taken,  and  genes  were  then  ranked  according  to  this  average  percentile’,  which 
generated  a  list  of  outlier  genes  arranged  by  importance;  and  (iv)  finally,  genes 
showing  an  outlier  profile  in  the  benign  samples  were  discarded. 

LNCaP  ChIP-seq  data.  Sequencing  data  from  GSE 14097  were  downloaded 
from  GEO.  Reads  from  the  LNCAP  H3K4me3  and  H3K36me3  ChIP-seq  sam¬ 
ples  were  mapped  to  human  genome  version  hgl9  using  BWA  0.5.9  (ref.  51). 
Peak  calling  was  performed  using  MACS^^  according  to  published  protocols^^. 
Data  were  visualized  using  the  UCSC  Genome  Browser^^. 

RWPE  ChIP-seq  data.  Sequencing  data  from  RWPE  SNE5  ChIP-seq 
samples  were  mapped  to  human  genome  version  hgl9  using  the  BWA  0.5.9 
algorithm^k  Although  we  performed  paired-end  sequencing,  the  ChIP-seq 
reads  were  processed  as  single-end  reads  to  adhere  to  our  preexisting  analysis 
protocol.  Basic  read  alignment  statistics  are  listed  in  Supplementary  Table  6a. 
Peak  calling  was  performed  with  respect  to  an  IgG  control  using  the  MACS 
algorithm^^.  We  bypassed  the  model-building  step  of  MACS  (using  the 
‘-nomodel’  flag)  and  specified  a  shift  size  equal  to  half  the  library  fragment 
size  determined  by  the  Agilent  Bioanalyzer  (using  the  ‘-shiftsize’  option).  Eor 
each  sample,  we  ran  the  CEAS  program  and  generated  genome-wide  reports^^. 
We  retained  peaks  with  an  EDR  less  than  5%  (peak  calling  statistics  across 
multiple  EDR  thresholds  are  shown  in  Supplementary  Table  6b).  We  then 
aggregated  SNE5  peaks  from  the  RWPE-LacZ,  RWPE-SChLAPl  isoform  1 
and  RWPE-SChLAPl  isoform  2  samples  using  the  ‘union  of  the  genomic  peak 
intervals.  We  intersected  peaks  with  RefSeq  protein- coding  genes  and  found 
that  1,299  peaks  occurred  within  1  kb  of  TSSs.  We  counted  the  number  of  reads 
overlapping  each  of  these  promoter  peaks  across  each  sample  using  a  custom 
Python  script  and  used  the  DESeq  R  package^^  version  1.6.1  to  compute  the 
normalized  fold  change  between  RWPE-LacZ  and  RWPE-SChLAPl  (both  iso¬ 
forms).  We  observed  that  389  of  the  1,299  promoter  peaks  had  at  least  a  2 -fold 
average  decrease  in  SNE5  binding.  This  set  of  389  genes  was  subsequently  used 
as  a  gene  set  for  GSEA  (Supplementary  Table  6c). 

Microarray  experiments.  We  performed  two-color  microarray  gene  expres¬ 
sion  profiling  of  22Rvl  and  LNCaP  cells  treated  with  two  independent  siRNAs 
targeting  SChLAPl  as  well  as  control  non-targeting  siRNAs.  These  profiling 
experiments  were  run  in  technical  triplicate  for  a  total  of  12  arrays  (6  from 
22Rvl  and  6  from  LNCaP).  Additionally,  we  profiled  22Rvl  and  LNCaP  cells 
treated  with  independent  siRNAs  targeting  SWI/SNE  component  SMARCBl  as 
well  as  control  non-targeting  siRNAs.  These  profiling  experiments  were  run  as 
biological  duplicates  for  a  total  of  four  arrays  (two  cell  lines  x  two  independent 
siRNAs  X  one  protein).  Einally,  we  profiled  RWPE  cells  expressing  two  dif¬ 
ferent  SChLAPl  isoforms  as  well  as  the  control  LacZ  gene.  These  profiling 
experiments  were  run  in  technical  duplicate  for  a  total  of  four  arrays  (two  from 
RWPE-SChLAPl  isoform  1  and  two  from  RWPE-SChLAPl  isoform  2). 

Processing  to  determine  ranked  gene  expression  lists.  All  of  the  microarray 
data  were  represented  as  log2  fold  change  between  targeting  versus  control 
siRNAs.  We  used  the  CollapseDataset  tool  provided  by  the  GSEA  package 
to  convert  Agilent  Probe  IDs  to  gene  symbols.  Genes  whose  expression  was 
measured  by  multiple  probes  were  consolidated  using  the  median  values 
obtained  with  these  probes.  We  then  ran  one-class  SAM  analysis  from  the 
Multi- Experiment  Viewer  application  and  ranked  all  genes  by  the  difference 
between  observed  versus  expected  statistics.  These  ranked  gene  lists  were 
imported  to  GSEA  version  2.07. 

SChLAPl  siRNA  knockdown  microarrays.  Eor  the  22Rvl  and  LNCaP 
SChLAPl  knockdown  experiments,  we  ran  the  GseaPreRanked  tool  to  discover 
enriched  gene  sets  in  MSigDB^^  version  3.0.  Lists  of  positively  and  negatively 
enriched  concepts  were  interpreted  manually. 
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SMARCBl  siRNA  knockdown  microarrays.  For  each  SMARCBl  knockdown 
experiment,  we  nominated  genes  that  were  altered  by  an  average  of  at  least 
twofold.  These  signatures  of  putative  SNF5  target  genes  were  then  used  to 
assess  enrichment  of  SChLAPl  -regulated  genes  using  the  GseaPreRanked  tool. 
Additionally,  we  nominated  genes  whose  expression  changed  by  an  average  of 
twofold  or  greater  across  SMARCBl  knockdown  experiments  and  quantified 
the  enrichment  for  SChLAPl  target  genes  using  GSEA. 

RWPE  SChLAPl  expression  microarrays.  RWPE-SChLAPl  versus  RWPE- 
LacZ  expression  profiles  were  ranked  using  SAM  analysis  as  described  above. 
A  total  of  1,245  genes  were  significantly  over-  or  underexpressed  and  are 
shown  in  Supplementary  Table  6d.  A  q  value  of  0.0  in  this  SAM  analysis 
signifies  that  no  permutation  generated  a  more  significant  difference  between 
observed  and  expected  gene  expression  ratios.  The  ranked  gene  expression 
list  was  used  as  input  for  the  GseaPreRanked  tool  and  compared  against  SNP5 
ChIP-seq  promoter  peaks  that  decreased  by  >2-fold  in  RWPE  cells  overex¬ 
pressing  SChLAPl.  Of  the  389  genes  in  the  ChIP-seq  gene  set,  250  were  pro¬ 
filed  by  the  Agilent  HumanGenome  microarray  chip  and  were  present  in  the 
GSEA  gene  symbol  database.  The  expression  profile  across  these  250  genes  is 
shown  in  Supplementary  Table  6e. 

RNA-seq  data.  We  assembled  an  RNA-seq  cohort  from  prostate  cancer  tis¬ 
sues  sequenced  at  multiple  institutions.  We  included  data  from  12  primary 
tumors  and  5  benign  tissues  published  in  GEO  (GSE22260)^^,  from  16  pri¬ 
mary  tumors  and  3  benign  tissues  released  in  the  database  of  Genotypes  and 
Phenotypes  (dbGAP)  (phs000310.vl.pl)^^  and  from  17  benign,  57  primary  and 
14  metastatic  tumors  sequenced  by  our  own  institution  and  released  in  dbGAP 
(phs000443.vl.pl).  Sample  information  is  shown  in  Supplementary  Table  la, 
and  sequencing  library  information  is  shown  in  Supplementary  Table  lb. 

RNA-seq  alignment  and  gene  expression  quantification.  Sequencing  data 
were  aligned  using  TopHat^^  version  1.3.1  against  the  Ensembl  GRCh37 
human  genome  build.  Known  introns  (Ensembl  release  63)  were  provided  to 
TopHat.  Gene  expression  across  genes  in  Ensembl  version  63  and  the  SChLAPl 
transcript  was  quantified  by  HT-Seq  version  0.5.3p3  using  the  script  ‘htseq- 
count’.  Reads  were  counted  without  respect  to  strand  to  avoid  bias  between 
unstranded  and  strand-specific  library  preparation  methods.  This  bias  results 
from  the  inability  to  resolve  reads  in  regions  where  two  genes  on  opposite 
strands  overlap  in  the  genome. 

RNA-seq  differential  expression  analysis.  Differential  expression  analysis 
was  performed  using  R  package  DESeq^^  version  1.6.1.  Read  counts  were 
normalized  using  the  estimateSizeEactors  function,  and  variance  was  modeled 
by  the  estimateDispersions  function.  Statistics  on  differential  expression  were 
computed  by  the  ‘nbinomTest’  function.  We  called  differentially  expressed 
genes  by  imposing  adjusted  P- value  cutoffs  for  cancer  versus  benign  samples 
(Padj  <  0.05),  metastasis  versus  primary  samples  (P^dj  <  0.05)  and  Gleason 
score  of  8-i-  versus  6  (Padj  <  0.10).  Heatmap  visualizations  of  these  analyses  are 
presented  as  Supplementary  Figure  5. 

RNA-seq  correlation  analysis.  Read  count  data  were  normalized  using  func¬ 
tions  from  the  R  package  DESeq  version  1.6.1.  Adjustments  for  library  size 
were  made  using  the  estimateSizeEactors’  function,  and  variance  was  modeled 
using  the  ‘estimateDispersions’  function  using  the  parameters  ‘method=blind’ 
and  ‘sharingMode=fit-only’.  Next,  raw  read  count  data  were  converted  to  pseu¬ 
docounts  using  the  ‘getVarianceStabilizedData  function.  Gene  expression 
levels  were  then  mean  centered  and  standardized  using  the  ‘scale’  function  in 
R.  Pearson’s  correlation  coefficients  were  computed  between  each  gene  of 
interest  and  all  other  genes.  Statistical  significance  of  Pearson’s  correlations 
was  determined  by  comparison  to  correlation  coefficients  achieved  with  1,000 
random  permutations  of  the  expression  data.  We  controlled  for  multiple- 
hypothesis  testing  using  the  ‘qvalue’  package  in  R.  The  SChLAPl  correlation 
signature  of  253  genes  was  determined  by  imposing  a  cutoff  of  ^  <  0.05. 

Oncomine  concepts  analysis  of  the  SChLAPl  signature.  We  separated 
the  253  genes  with  expression  levels  significantly  correlated  with  SChLAPl 
into  positively  and  negatively  correlated  gene  lists.  We  imported  these  gene 


lists  into  Oncomine  as  custom  concepts.  We  then  nominated  significantly 
associated  prostate  cancer  concepts  with  odds  ratio  >3.0  and  P  <  1  x  10“^. 
We  exported  these  results  as  the  nodes  and  edges  of  a  concept  association 
network  and  visualized  the  network  using  Cytoscape  version  2.8.2.  Node  posi¬ 
tions  were  computed  using  the  Eorce-Directed  Layout  algorithm  in  Cytoscape 
using  the  odds  ratio  as  the  edge  weight.  Node  positions  were  subtly  altered 
manually  to  enable  better  visualization  of  node  labels. 

Association  of  correlation  signatures  with  Oncomine  concepts.  We  applied 
our  RNA-seq  correlation  analysis  procedure  to  the  genes  SChLAPl,  EZH2, 
PCA3,  AMACR  and  ACTB.  Eor  each  gene,  we  created  signatures  from  the  top 
5%  of  positively  and  negatively  correlated  genes  (Supplementary  Table  3). 
We  performed  a  large  meta- analysis  of  these  correlation  signatures  across 
Oncomine  data  sets  corresponding  to  disease  outcome  (Glinsky  Prostate  and 
Setlur  Prostate),  metastatic  disease  (Holzbeierlein  Prostate,  Lapointe  Prostate, 
LaTulippe  Prostate,  Taylor  Prostate  3,  Vanaja  Prostate,  Varambally  Prostate 
and  Yu  Prostate),  advanced  Gleason  score  (Bittner  Prostate,  Glinsky  Prostate, 
Lapointe  Prostate,  LaTulippe  Prostate,  Setlur  Prostate,  Taylor  Prostate  3  and  Yu 
Prostate)  and  localized  cancer  (Arredouani  Prostate,  Holzbeierlein  Prostate, 
Lapointe  Prostate,  LaTulippe  Prostate,  Taylor  Prostate  3,  Varambally  Prostate 
and  Yu  Prostate).  We  also  incorporated  our  own  concept  signatures  for  metas¬ 
tasis,  advanced  Gleason  score  and  localized  cancer  determined  from  our  RNA- 
seq  data.  Eor  each  concept,  we  downloaded  the  gene  signatures  corresponding 
to  the  top  5%  of  genes  up-  and  downregulated.  Pairwise  signature  compari¬ 
sons  were  performed  using  a  one-sided  Eisher’s  exact  test.  We  controlled  for 
multiple-hypothesis  testing  using  the  ‘qvalue’  package  in  R.  We  considered 
concept  pairs  with  q  <  0.01  and  odds  ratio  >  2.0  as  significant.  In  cases  where 
a  gene  signature  associated  with  both  the  over-  and  underexpression  gene 
sets  from  a  single  concept,  only  the  most  significant  result  (as  determined  by 
odds  ratio)  is  shown. 

Analysis  of  SChLAPl  and  SMARCBl  expression  signatures.  Gene  signatures 
obtained  with  knockdown  of  SChLAPl  and  SMARCBl  were  generated  from 
Agilent  gene  expression  microarray  data  sets.  Eor  each  cell  line,  we  obtained 
a  single  vector  of  per-gene  fold  changes  by  averaging  technical  replicates  and 
then  taking  the  median  across  biological  replicates.  We  merged  the  results 
from  individual  cell  line  using  the  median  of  the  changes  in  22Rvl  and  LNCaP 
cells.  Venn  diagram  plots  were  produced  using  the  Bio  Venn  website^®.  We  then 
compared  the  top  10%  of  upregulated  and  downregulated  genes  with  knock¬ 
down  of  SChLAPl  and  SMARCBl  to  gene  signatures  downloaded  from  the 
Taylor  Prostate  3  data  set  in  the  Oncomine  database.  We  performed  signature 
comparison  using  one-sided  Eisher’s  exact  tests  and  controlled  for  multiple 
testing  using  the  R  package  ‘qvalue’.  Signature  comparisons  with  q  <  0.05  were 
considered  significantly  enriched.  We  plotted  the  odds  ratios  from  significant 
comparisons  using  the  ‘heatmap.2’  function  in  the  ‘gplots’  R  package. 

Kaplan-Meier  survival  analysis  based  on  the  SChLAPl  gene  signature. 

We  downloaded  prostate  cancer  expression  profiling  data  and  clinical 
annotations  from  GSE8402,  published  by  Setlur  et  al^^.  We  intersected  the 
253 -gene  SChLAPl  signature  with  the  genes  in  this  data  set  and  found  80  genes 
in  common.  We  then  assigned  SChLAPl  expression  scores  to  each  patient 
sample  in  the  cohort  using  the  unweighted  sum  of  standardized  expression 
levels  across  the  80  genes.  Given  that  we  observed  SChLAPl  expression  in 
approximately  20%  of  prostate  cancer  samples,  we  used  the  80th  percentile  of 
SChLAPl  expression  scores  as  the  threshold  for  ‘high’  versus  ‘low’  scores.  We 
then  performed  10-year  survival  analysis  using  the  ‘survival’  package  in  R  and 
computed  statistical  significance  using  the  log-rank  test. 

Additionally,  we  imported  the  253 -gene  SChLAPl  signature  into  Oncomine 
to  download  the  expression  data  for  167  of  the  253  genes  profiled  by  the 
Glinsky  prostate  data  set^^.  We  assigned  SChLAPl  expression  scores  in  a  simi¬ 
lar  fashion  and  designated  the  top  20%  of  patients  as  having  ‘high’  SChLAPl 
scores.  We  performed  survival  analysis  using  the  time  to  biochemical  pros¬ 
tate-specific  antigen  (PSA)  recurrence  and  computed  statistical  significance 
as  described  above. 

PhyloCSF  analysis.  We  obtained  46-way  multi- alignment  EASTA  files  for 
SChLAPl,  HOTAIR,  GAPDH  and  ACTB  using  the  ‘Stitch  Gene  blocks’  tool 
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within  the  Galaxy  bioinformatics  framework.  We  evaluated  each  gene  for 
the  likelihood  that  it  represented  a  protein-coding  region  using  PhyloCSF 
software  (version  released  28  October  2012).  Each  gene  was  evaluated  using 
the  phylogeny  from  29  mammals  (available  by  default  within  PhyloCSF)  in 
any  of  the  3  reading  frames.  Scores  are  measured  in  decibans  and  represent  the 
likelihood  ratio  that  a  sequence  is  protein-coding  rather  than  noncoding. 

Mayo  Clinic  cohort  analyses.  Subjects  were  selected  from  a  cohort  of  individ¬ 
uals  from  the  Mayo  Clinic  with  high-risk  prostate  cancer  who  had  undergone 
radical  prostatectomy.  The  cohort  was  defined  as  1,010  men  with  high-risk 
prostate  cancer  who  underwent  radical  prostatectomy  between  2000  and  2006, 
of  whom  73  developed  clinical  progression  (defined  as  individuals  with  sys¬ 
temic  disease  as  evidenced  by  positive  bone  or  computed  tomography  (CT) 
scan)^k  High  risk  of  recurrence  was  defined  by  preoperative  PSA  levels  of 
>20  ng/ml,  pathological  Gleason  score  of  8-10,  seminal  vesicle  invasion  (SVI) 
or  Gleason,  PSA,  seminal  vesicle  and  margin  (GPSM)  score  of  >10  (ref.  62). 
The  subcohort  incorporated  all  73  subjects  with  clinical  progression  to  sys¬ 
temic  disease  and  a  random  sampling  of  20%  of  the  entire  cohort  (202  men, 
including  19  with  clinical  progression).  The  total  case-cohort  study  included 
256  subjects,  and  tissue  specimens  were  available  from  235  subjects.  The 
subcohort  was  previously  used  to  validate  a  genomic  classifier  for  predicting 
clinical  progression^  k 

Tissue  preparation.  Formalin-fixed,  paraffin-embedded  samples  of  human 
prostate  adenocarcinoma  prostatectomies  were  collected  from  subjects  with 
informed  consent  at  the  Mayo  Clinic  according  to  an  IRB-approved  protocol. 
Pathological  review  of  tissue  sections  stained  with  hematoxylin  and  eosin 
was  used  to  guide  macrodissection  of  the  tumor  from  surrounding  stromal 
tissue  in  three  to  four  10-pm  sections.  The  index  lesion  was  considered  as  the 
dominant  lesion  by  size. 

RNA  extraction  and  microarray  hybridization.  For  the  validation  cohort, 
total  RNA  was  extracted  and  purified  using  a  modified  protocol  for  the  com¬ 
mercially  available  RNeasy  FFPE  nucleic  acid  extraction  kit  (Qiagen).  RNA 
concentrations  were  calculated  using  a  Nanodrop  ND-1000  spectrophotometer 
(Nanodrop  Technologies).  Purified  total  RNA  was  subjected  to  whole-tran- 
scriptome  amplification  using  the  WT-Ovation  FFPE  system  according  to  the 
manufacturer’s  recommendation  with  minor  modifications  (NuGen).  For  the 
validation,  only  the  Ovation  FFPE  WTA  System  was  used.  Amplified  products 
were  fragmented  and  labeled  using  the  Encore  Biotin  Module  (NuGen)  and 
hybridized  to  Affymetrix  Human  Exon  (HuEx)  1.0  ST  GeneChips  following 
the  manufacturer’s  recommendations. 

Microarray  expression  analysis.  Normalization  and  summarization  of  the 
microarray  samples  was  performed  with  the  frozen  Robust  Multiarray  Average 
(fRMA)  algorithm  using  custom  frozen  vectors.  These  custom  vectors  were 
created  using  the  vector  creation  methods  described  previously^^.  Quantile 
normalization  and  robust  weighted  average  methods  were  used  for  normaliza¬ 
tion  and  summarization,  respectively,  as  implemented  in  fRMA. 

Statistical  analysis.  Given  the  exon-intron  structure  of  isoform  1  of  SChLAPl, 
all  probe  selection  regions  (or  PSRs)  that  fell  within  the  genomic  span 
of  SChLAPl  were  inspected  for  overlap  with  any  of  the  exons  of  this  gene. 
One  PSR,  2518129,  was  found  to  be  fully  nested  within  exon  3  of  SChLAPl 


and  was  used  for  further  analysis  as  a  representative  PSR  for  this  gene.  The 
PAM  (Partition  Around  Medoids)  unsupervised  clustering  method  was  used 
on  the  expression  values  of  all  clinical  samples  to  define  two  groups  with  high 
and  low  expression  of  SChLAPl. 

Statistical  analysis  on  the  association  of  SChLAPl  with  clinical  outcomes 
was  carried  out  using  three  endpoints:  (i)  biochemical  recurrence,  defined 
as  two  consecutive  increases  in  serum  PSA  of  >0.2  ng/ml  after  radical 
prostatectomy;  (ii)  clinical  progression,  defined  as  a  positive  CT  or  bone  scan; 
and  (hi)  prostate  cancer-specific  mortality. 

For  the  clinical  progression  end  point,  all  subjects  with  clinical  progression 
were  included  in  the  survival  analysis,  whereas  controls  in  the  sub  cohort  were 
weighted  in  a  fivefold  manner  to  be  representative  of  individuals  from  the 
original  cohort.  For  the  prostate  cancer-specific  mortality  end  point,  cases 
who  did  not  die  from  prostate  cancer  were  omitted,  and  weighting  was  applied 
in  a  similar  manner.  For  biochemical  recurrence,  because  the  case  cohort 
was  designed  on  the  basis  of  the  clinical  progression  end  point,  resampling 
of  subjects  with  biochemical  recurrence  and  the  subcohort  was  performed  to 
have  a  representative  of  the  selected  individuals  with  biochemical  recurrence 
from  the  original  cohort. 
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ABSTRACT: 

Long  noncoding  RNAs  (IncRNAs)  are  increasingly  implicated  in  cancer  biology, 
contributing  to  essential  cancer  cell  functions  such  as  proliferation,  invasion,  and 
metastasis.  In  prostate  cancer,  several  IncRNAs  have  been  nominated  as  critical  actors 
in  disease  pathogenesis.  Among  these,  expression  of  PCGEMl  and  PRNCRl  has  been 
identified  as  a  possible  component  in  disease  progression  through  the  coordination  of 
androgen  receptor  (AR)  signaling  (Yang  et  al..  Nature  2013,  see  ref.  [1]).  However, 
concerns  regarding  the  robustness  of  these  findings  have  been  suggested.  Here, 
we  sought  to  evaluate  whether  PCGEMl  and  PRNCRl  are  associated  with  prostate 
cancer.  Through  a  comprehensive  analysis  of  RNA-sequencing  data  (RNA-seq),  we  find 
evidence  that  PCGEMl  but  not  PRNCRl  is  associated  with  prostate  cancer.  We  employ 
a  large  cohort  of  >230  high-risk  prostate  cancer  patients  with  long-term  outcomes 
data  to  show  that,  in  contrast  to  prior  reports,  neither  gene  is  associated  with  poor 
patient  outcomes.  We  further  observe  no  evidence  that  PCGEMl  nor  PRNCRl  interact 
with  AR,  and  neither  gene  is  a  component  of  AR  signaling.  Thus,  we  conclusively 
demonstrate  that  PCGEMl  and  PRNCRl  are  not  prognostic  IncRNAs  in  prostate  cancer 
and  we  refute  suggestions  that  these  IncRNAs  interact  in  AR  signaling. 


INTRODUCTION 

Long  noncoding  RNAs  (IncRNAs)  have  emerged 
as  a  critical  element  in  cell  biology,  contributing  to  a 
wide  variety  of  cellular  behaviors  and  functions  [2].  In 
cancer,  IncRNAs  have  been  the  subject  of  much  research 
during  the  past  five  years.  Notably,  IncRNAs  are  known 
to  coordinate  aggressive  phenotypes  of  several  common 


tumors,  including  breast  cancer  and  prostate  cancer  [3,4]. 
Large  profiling  studies  have  suggested  that  upwards  of 
10,000  IncRNAs  may  exist  in  the  human  genome  [5];  yet 
only  a  fraction  of  these  entities  have  been  characterized. 
Thus,  the  identity  and  function  of  IncRNAs  in  cancer 
remains  largely  unknown. 

In  prostate  cancer,  several  IncRNAs,  including 
PCA3  and  PCAT-1,  have  been  shown  to  be  upregulated 
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in  patients  with  cancer  [6-9].  Recently,  two  IncRNAs, 
PCGEMl  and  PRNCRl,  have  been  suggested  in  prostate 
cancer  to  act  as  mediators  of  castration-resistance  disease 
by  binding,  in  a  direct  and  sequential  fashion,  to  the 
androgen  receptor  (AR),  causing  ligand-independent 
activation  of  its  gene  expression  programs  [1].  While 
PCGEMl  has  been  observed  in  prostate  cancer  previously 
[6,  10],  PRNCRl  is  a  poorly  characterized  transcript,  and 
we  were  concerned  that  PRNCRl  had  not  been  nominated 
by  previous  global  profiling  studies  of  prostate  cancers  [7, 
11-14]. 

We  therefore  sought  to  investigate  PRNCRl  and 
PCGEMl  in  prostate  cancer.  In  specific,  we  sought  to 
reproduce  three  core  observations  suggested  by  Yang  et  al 
published  in  Nature  (see  [1])  and  include:  1)  thsit  PRNCRl 
and  PCGEMl  are  highly  overexpressed  in  aggressive 
forms  of  prostate  cancer,  2)  that  these  two  IncRNAs  bind 
to  AR  under  ligand- stimulated  conditions,  and  3)  that  the 
coordination  of  PRNCRl  and  PCGEMl  interact  with  AR 
via  specific  post-translational  modifications  of  the  AR 
protein.  Here,  we  report  that  none  of  these  three  findings 
is  fully  reproducible. 

First,  we  asked  whether  PCGEMl  and  PRNCRl 
are  highly  overexpressed  in  aggressive  prostate  cancer, 
as  suggested  by  others  (see  [1,  15]).  Indeed,  while  some 
have  argued  that  these  IncRNAs  are  critical  in  castration- 
resistant  prostate  cancer  [1],  there  has  been  no  study 
that  evaluated  the  expression  of  these  IncRNAs  in  tissue 
samples  from  human  castrate-resistant  prostate  cancers 
(CRPC).  To  evaluate  these  IncRNAs  in  more  detail, 
we  first  assessed  their  expression  levels  in  171  human 
prostatic  tissues  using  RNA  sequencing  data  aggregated 
from  four  independent  studies  of  prostate  cancer,  including 
our  own  internal  datasets  [1,  12-14]  (Fig.  lA).  Whereas 
we  found  robust  expression  of  PCGEMl  in  a  subset  of 
prostate  tissues  (RPKM  >1  in  82  samples;  RPKM  >10 
in  27  samples),  we  observed  scant  levels  of  PRNCRl  in 
all  samples  (RPKM  >1  in  only  3  samples;  RPKM  >10 
in  0  samples)  (Supplementary  Table  1).  This  does  not 
lend  confidence  to  PRNCRl  as  a  significant  entity  in  this 
disease.  For  comparison,  we  used  the  prostate  cancer 
IncRNA  SChLAPl  as  a  positive  control.  We  found  extreme 
overexpression  of  SChLAPl  in  samples  from  all  datasets 
(RPKM  >1  in  69  samples;  RPKM  >10  in  26  samples) 
(Supplementary  Fig.  1).  To  rule  out  the  possibility  that 
PRNCRl  was  a  non-poly-adenylated  RNA,  we  verified 
experimentally  that  PRNCRl  was  observed  in  the  poly- A 
fraction  of  RNA  that  was  used  to  generate  the  RNA-seq 
data  (Supplementary  Fig.  2). 

Given  the  low  support  for  PRNCRl  in  the  RNA-seq 
data,  we  next  confirmed  these  findings  using  quantitative 
PCR  (qPCR)  in  a  large  set  of  prostate  cancer  tissues 
including  34  PCAs  and  31  CRPC  tumors  as  well  as  18 
benign  adjacent  tissues.  As  shown  in  Fig.  IB,  PCGEMl 
is  upregulated  in  clinically  localized  cancer,  confirming 
the  known  literature  [6,  10];  however  PR/VCR 7  expression 


does  not  demonstrate  a  convincing  association  with 
prostate  cancer.  We  found  a  borderline  decrease  in 
PRNCRl  expression  in  metastatic  castrate-resistant  cancer 
(p  =  0.047,  Student’s  t-test).  We  used  PCATl,  EZH2,  and 
SChLAPl  as  control  genes,  all  of  which  have  elevated 
expression  in  prostate  cancer  metastases.  Conversely, 
we  used  PC  A3  as  a  control  gene  that  is  known  to  be 
upregulated  in  localized  prostate  cancer  but  not  metastatic 
prostate  cancer.  Finally,  while  PCGEMl  is  upregulated 
in  cancer  patients  from  matched  tumor/benign  samples, 
PRNCRl  does  not  convincingly  exhibit  this  pattern  of 
upregulation  (Supplementary  Fig.  3). 

Next,  an  independent  analysis  of  235  high- 
risk  prostate  cancer  tissues  demonstrated  that  neither 
PCGEMl  nor  PRNCRl  is  associated  with  aggressive 
prostate  cancer,  and  neither  IncRNA  stratifies  prostate 
cancer-specific  mortality  (Fig.  1C,D  and  Supplementary 
Tables  2,3).  An  analysis  of  intermediate  endpoints  such 
as  biochemical  recurrence  and  progression  to  metastatic 
disease  demonstrated  a  trend  for  PCGEMl  and  PRNCRl 
to  be  associated  with  less  aggressive  disease  and  favorable 
outcomes  (Supplementary  Fig.  4),  which  contradicts 
previous  claims  that  these  IncRNAs  are  involved  in 
an  aggressive  patient  clinical  course  [1,  15].  Using  an 
independent  validation  cohort  of  tissues  we  verified 
that  neither  PCGEMl  nor  PRNCRl  is  associated  with 
aggressive  prostate  cancer  (Supplementary  Table  2). 
By  contrast,  we  have  used  these  datasets  to  confirm  the 
prognostic  utility  of  the  IncRNA  SChLAPl  in  prostate 
cancer,  and  high  expression  of  SChLAPl  is  a  powerful 
predictor  for  poor  patient  survival  (Fig.  IE)  [4]. 

Next,  we  examined  whether  PCGEMl  and  PRNCRl 
interacted  with  AR.  We  performed  RNA-IP  (RIP)  assays 
using  two  independent  AR  antibodies,  including  the  same 
antibody  that  was  previously  used  to  show  an  interaction 
between  these  IncRNAs  and  AR  [1].  In  accordance  with 
the  published  literature,  we  performed  a  time-series  of 
RIP  experiments  following  AR  stimulation,  because  prior 
data  suggests  that  these  IncRNAs  bind  AR  from  1-2  hours 
after  AR  stimulation  but  not  at  4  hours  post-stimulation 
[1].  In  our  RIP  experiments,  we  could  not  confirm  that  AR 
binds  to  PCGEMl  or  PRNCRl  at  either  1  hour  or  4  hours 
post-stimulation  with  DHT  (Fig.  IF  and  Supplementary 
Fig.  5).  Similarly,  in  cells  grown  at  steady-state,  we  used  a 
second  AR  antibody  and  did  not  observe  binding  between 
AR  and  PCGEMl  or  PRNCRl  (Supplementary  Fig.  6). 
DHT-stimulated  cells  also  demonstrated  no  induction  in 
PCGEMl  ox  PRNCRl  expression  (Supplementary  Fig.  7). 
These  results  imply  that  PCGEMl  and  PRNCRl  are  not 
AR- interacting  IncRNAs. 

Finally,  earlier  data  propose  that  PCGEMl  and 
PRNCRl  interact  with  AR  via  specific  post-translational 
modifications  (PTMs),  specifically  K349  methylation 
(K349Me)  for  PCGEMl  and  K631/K634  acetylation 
(K631Ac/K634Ac)  for  7^7^AC7^7  [1].  To  search  for  these 
PTMs,  we  independently  performed  mass  spectrometry  for 
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Figure  1:  PCGEMl  and  PRNCRl  are  not  associated  with  prostate  cancer  progression  and  do  not  bind  the  androgen 
receptor.  (A)  Plot  showing  PCGEMl  (grey  bars)  and  PRNCRl  (red  eireles)  expression  levels  (Reads  per  Kilobase  per  Million  Reads,  or 
RPKM)  aeross  171  samples  from  four  RNA-Seq  studies  of  prostate  eaneer:  Miehigan  Center  for  Translational  Pathology  (MCTP,  internal 
data  and  dbGAP,  phs000443.vLpl),  Ren  et  al.  [13]  (EGA,  ERP00550),  Kannan  et  al.  [14]  (GEO,  GSE22260),  and  Pflueger  et  al.  [12] 
(dbGAP,  phs000310.vl.pl).  Inset  box  shows  deseriptive  statisties  for  eaeh  study.  (B)  Quantitative  PCR  for  PCGEMl  and  PRNCRl  in  a 
eohort  of  prostate  eaneer  tissues,  benign  (n  =  18),  loealized  eaneer  (n  =34),  metastatie  eaneer  (n  =  31).  An  asterisk  (*)  indieates  p  <  0.05. 
Two  asterisks  (**)  indieate  p  <  0.01.  n.s.  =  non-signifieant.  P  values  were  determined  by  a  two-tailed  Student’s  t-test.  Data  for  SChLAPl 
is  obtained  and  re-analyzed  from  a  prior  publieation  (ref  [4]).  (C)  PCGEMl  expression  does  not  prediet  for  prostate  eaneer-speeifie 
mortality  (PCSM).  (D)  PRNCRl  expression  does  not  prediet  for  PCSM.  (E)  High  SChLAPl  expression  is  a  powerful  predietor  of  PCSM  (p 
=  0.0022).  Data  in  (E)  is  reprodueed  from  a  prior  publieation  (ref  [4]).  P  values  in  (C-E)  are  determined  using  a  log-rank  test.  (F)  RNA- 
immunopreeipitation  (RIP)  for  AR  following  stimulation  ofENCaP  eells  with  lOOnM  DHT  does  not  show  binding  of  PRNCRl  ov  PCGEMl 
to  AR.  U1  binding  to  SNRNP70  is  used  as  a  positive  eontrol.  PCAT-1,  ANRIL,  andMALATl  serve  as  negative  eontrols.  Inset:  Western  blot 
eonfirmation  of  AR  protein  pull-down  by  the  immunopreeipitation  assays.  Error  bars  represent  S.E.M. 
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AR  in  the  LNCaP  cell  line,  achieving  95%  coverage  of  all 
possible  tryptic  peptides.  We  were  unable  to  confirm  that 
these  PTMs  (K349Me,  K631Ac,  or  K634Ac)  are  present 
on  AR  (Supplementary  Fig.  8  and  Supplementary  Table 
4).  To  examine  this  discrepancy  further,  we  re-analyzed 
prior  AR  MS  data  (found  in  [1]).  Although  this  MS  dataset 
was  obtained  with  a  trypsin  digestion  to  prepare  samples 
for  MS,  we  found  no  fully  tryptic  peptides  supporting 
the  nomination  of  K349Me,  K631Ac,  or  K634Ac 
(Supplementary  Fig.  8).  In  fact,  in  the  MS  data  for  these 
PTMs  in  ref.  [1],  almost  all  peptides  harboring  these 
PTMs  are  non-tryptic,  which  are  generally  considered  to 
be  analysis  artifacts  since  true  non-tryptic  peptides  are 
exceedingly  rare  following  a  trypsin  digestion  [16-18] 
(Supplementary  Discussion).  Non-tryptic  peptides  are 
also  associated  with  a  high  false-discovery  rate  [19].  All 
peptides  nominating  the  K349Me,  K631Ac,  or  K634Ac 
PTMs  in  ref.  [1]  also  had  multiple  additional  PTMs  that 
were  nominated,  indicating  non-specificity.  These  included 
extraordinarily  rare  and  unusual  PTMs  such  as  oxidated 
lysine  and  deamidated  asparagine,  which  suggest  technical 
artifacts  given  the  negligible  likelihood  of  multiple  rare 
and  unusual  PTMs  occurring  on  true  non-trypic  peptides. 
The  statistical  confidence  for  these  non-tryptic  peptides 
is  <5%,  whereas  the  corresponding  fully  tryptic  peptides 
for  these  amino  acid  residues  had  statistical  confidences 
>90%. 

In  summary,  we  have  been  unable  to  show  a 
convincing  role  for  PCGEMl  or  PRNCRl  in  aggressive 
prostate  cancer  or  AR  signaling.  First,  our  data  analysis 
of  numerous  human  prostate  cancer  tissues  from  multiple 
independent  laboratories  indicates  that  neither  PCGEMl 
nor  PRNCRl  are  associated  with  castration-resistant 
prostate  cancer.  Second,  we  were  unable  to  verify  that 
PCGEMl  and  PRNCRl  bind  to  the  androgen  receptor. 
Lastly,  we  are  unconvinced  that  the  K349Me,  K631Ac, 
or  K634Ac  AR  PTMs  represent  a  plausible  mechanism 
for  interaction  between  AR  and  PCGEMl  and  PRNCRl. 
While  our  results  challenge  the  notion  that  PCGEMl  and 
PRNCRl  play  a  causal  role  in  prostate  cancer,  we  regard 
IncRNAs  as  an  emerging  field  of  study  in  cancer  [3,  6,  20, 
21]  and  we  are  encouraged  by  the  interest  in  IncRNAs  in 
prostate  cancer. 

METHODS 

Prostate  tissues  were  obtained  from  the  radical 
prostatectomy  series  and  Rapid  Autopsy  Program 
at  the  University  of  Michigan  tissue  core.  All  tissue 
samples  were  collected  with  informed  consent  under  an 
Institutional  Review  Board  (IRB)  approved  protocol  at 
the  University  of  Michigan.  Outcomes  analyses  were 
performed  on  a  cohort  of  Mayo  Clinic  prostate  cancer 
radical  prostatectomy  samples  obtained  under  an  IRB- 
approved  protocol  as  described  previously.  Cell  lines 
were  maintained  according  to  standard  conditions.  For 


RIP  experiments,  cells  were  deprived  of  androgen  for 
48  hours  prior  to  stimulation  with  lOOnM  DHT.  RIP 
experiments  were  performed  as  previously  described  [  1 , 
4].  Bioinformatics  analyses  utilized  publicly  available 
RNA-Seq  data.  Please  see  Supplementary  Methods  for 
details. 
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